AD-A061  656 
UNCLASSIFIED 


AEROSPACE  CORP  EL  SE6UND0  CALIF  ENGINEERING  SCIENCE  —ETC  F/6  9/6 
TRANSFORM  IMA6E  COOING. (U) 

MAT  76  A 6 TESCHER  F06701-77-C-0078 

TR-0078( 3678-301-1  SAMS0-TR-78-127  NL 


1 


-1 

1 OF  2 

«8ei  666 

■ 

« S 1 M | 

l~  - 

JJk  K. 

■ 

ra 

\ 



■ 

n 

Jti 

DOC  FILE  COPY'  AD  AO  61  656 


REPORT  SAMSO-TR-78-1 27 


Transform  Image  Coding 


Prepared  by  A.  G.  TESCHER 

Electronics  and  Optics  Division  - 
MSngineering  Science  Operational^ 
The  Aerospace  Corporation 
El  Segundo,  Calif.  90245 


May  1978 


Final  Report 

(1  April-30  November  1977) 


APPROVED  FOR  PUBLIC  RELEASE; 
DISTRIBUTION  UNLIMITED 


THE  AEROSPACE  CORPORATION 

Prepared  for 

SPACE  AND  MISSILE  SYSTEMS  ORGANIZATION 
AIR  FORCE  SYSTEMS  COMMAND 
Los  Angeles  Air  Force  Station 
P.O.  Box  92960,  Worldway  Postal  Center 
Los  Angeles,  Calif.  90009 


This  report  was  submitted  by  The  Aerospace  Corporation,  El  Segundo, 
CA  90245,  under  Contract  F04701  - 77-C-0078  with  the  Space  and  Missile 
Systems  Organization,  P,  O.  Box  92960,  Worldway  Postal  Center, 

Los  Angeles,  CA  90009.  It  was  reviewed  and  approved  for  The  Aerospace 
Corporation  by  A.  J.  Boardman,  Electronics  and  Optics  Division,  and 
J.  R.  Henry,  Satellite  Systems  Division.  Lt  Col  D.  A.  Widen,  SAMSO/YD, 
was  the  project  engineer. 

This  report  has  been  reviewed  by  the  Office  of  Information  and  is 
releasable  to  the  National  Technical  Information  Service  (NTIS).  At  NTIS, 
it  will  be  available  to  the  general  public,  including  foreign  nations. 

This  technical  report  has  been  reviewed  and  approved  for  publication. 
Publication  of  this  report  does  not  constitute  Air  Force  approval  of  the 
report's  findings  or  conclusions.  It  is  published  only  for  the  interchange 
and  stimulation  of  ideas. 


D.  A.  Widen,  Lt  Col,  USAF 
Project  Engineer 


FOR  THE  COMMANDER 


1 


5SIFICATION  OF  THIS  PAGE  Date  Entered) 


I SAMSO/-TR- 78  - 127  / 


■ 


title  c«id  Submit) 


TRANSFORM  IMAGE  CODING 


7.  AU  TmOR( ») 


Andrew  G./Tescher 


9 PERFORMING  ORGANIZATION  NAME  AND  ADDRESS 

The  Aerospace  Corporation 
El  Segundo,  Calif.  90245  ' 


READ  INSTRUCTIONS 
BEFORE  COMPLETING  FORM 


TR- 00  78(3478  - 30)- 1 / 


10.  PROGRAM  ELEMENT.  PROJECT,  TASK 
AREA  4 WORK  UNIT  NUMBERS 


It.  CON  TROLLING  OFFICE  NAME  AND  ADDRESS 

Space  and  Missile  Systems  Organization  / // 

Air  Force  Systems  Command  — / 

Eos  Angeles,  Calif.  90009 


U MONITORING  AGENCY  NAME  h AODRESSf//  different  from  Controlling  Office)  15.  SECURITY  CLASS,  (ol  thla  report) 

.7  A^r  Unclassified 


J2.  reboot  BATE — 

' May 


13.  NUMBER  OF  PAGES 
112 


15  a.  DECLASSIFICATION/ DOWN  GRADING 
SCHEDULE 


IS.  DISTRIBUTION  STATEMENT  (ol  Ihlt  Report) 


Approved  for  public  release;  distribution  unlimited. 


17.  DISTRIBUTION  STATEMENT  (ol  tho  ebetrect  entered  In  Block  20,  II  dlllmront  trom  Report) 


19  KEY  WORDS  (Contln 


aide  It  nacaaamry  and  Identity  by  block  number) 


Communication  technology 
Data  compression 
Image  coding 
Image  compression 


Image  processing 
Transform  coding 


20vv  ABSTRACT  (Continue  on  reverae  aide  It  nacaaamry  and  Identity  by  block  number) 

In  recent  years,  considerable  research  effort  has  been  done  to  optimize 
various  digital  communication  channels.  The  significant  growth  of  both 
commercial  and  military  communication  systems  has  emphasized  the  need 
for  efficient  data  compression  procedures.  In  the  case  of  image  trans- 
mission, the  requirement  is  pa  rticula  riy  demanding  because  of  the  significant 
amount  of  information  to  be  transmitted.  - — 


UNCLASSIFIED 

SECURITY  CLASSIFICATION  OF  THIS  PAGE  (When  Dote  EnlAr.rf) 


SCCURITy  CL  ASSI A (CATION  Of  TMIJ  PAgCfWftan  Pmlf  Kntrrt4) 


I*  KEY  (OKOSlCiinllnuad) 


ABSTRACT  (T<uilfnu«ri> 

In  llus  report,  imago  nimprcssion  procedures  based  on  transform  toih 
tuques  are  reviewed.  The  emphasis  is  on  adaptive  and,  especially,  on 
rate  adaptive  algorithms. 

The  discussion  includes  an  historical  review,  theoretical  development, 
illustrative  examples  of  the  transform  image  coding  field. 


SfCllAITY  CLASSIFICATION  Or  THIS  PAO*fWH»n  Pair  Knf.ryrfi 


PR K PACK 


The  assistance  of  several  individuals  in  tin-  preparation  of  this  report 
is  greatly  appreciated.  The  suggestions  of  Professor  William  K.  Pratt  of 
the  Image  Processing  Institute,  University  of  Southern  California,  were 
most  helpful.  Ms.  Carolyn  Modeller  provided  expert  editoiial  assistance. 

Ms.  Peggy  Ritter  t v pod  the  earlier  versions  ot  the  manuscript.  The  encourage 
men!  from  The  Aerospace  Corporation  and.  in  particular,  from  Mr.  Charles 
.1 . Tcontis  to  undertake  this  effort  has  been  most  welcome. 


llus  report,  in  somewhat  different  format,  is  concurrently  being  published 
hv  the  Academic  Press.  Inc  . New  York.  N.  Y.  . as  Chapter  •(  of  Image 
Transmission  I'echniuuos  (W.  K.  Pratt,  editor). 


CONTENTS 


INTRODUCTORY  CONCEPTS  7 

1.  Image  Coding  Motivations 7 

2.  Theoretical  Backgrounds  8 

3.  Practical  Constraints  10 

4.  Implementation  Concepts  13 

5.  Communication  Models  15 

IMAGE  MODELS 18 

1.  Generalized  Imaging  System 18 

2.  Video  Models 23 

3.  Stationary  and  Nonstationary  Models 24 

4.  Vector  Models  29 

5.  Image  Quality 29 

TRANSFORM  CODING 31 

1.  Basic  Concepts 34 

2.  Review  of  Classical  Techniques 35 

3.  Image  Quality  Degradation  48 

4.  Tradeoff  Between  Imaging  Design  and 

Compression  System 50 

5.  Fundamental  Limitations  of  Nonadaptive 

Transform  Coding 51 

ADAPTIVITY  CONCEPTS 53 

1.  Nonstationary  Image  Model  Considerations  53 

2.  Communication  Model  as  a Practical 

Constraint 54 

3.  Types  of  Adaptivity 55 

4.  Buffering  Concepts  59 


tMOBll*®  F** 


CONT ENTS  (Continued) 


4 


KI(  u:r  i :s 


1.  "Typical  Distortion  (D)  vs  Rate  (R)  Curves  . . 

2.  Gene rali/.ed  Imaging  System 

3.  Recursive  Image  Modeling  

4.  Demonstration  o£  Nonadaptive  Processing  . . . 

5.  Schematics  of  Transform  Coding 

6.  Demonstration  of  Symmetry  for  Cosine 

Transform 

7.  Quantizer  Demonstration  (Tescher,  1973)  . . . 

8.  The  NASA/Ames  Compression  System 

(Knauer,  1975)  

9.  Variable  Rate  Coding  System 

10.  Variable  Rate  Coding  through  Prebuffering  . . 

11.  Demonstration  of  Image  Quality  Discontinuity 

with  Prebuffering  

12.  Variable  Rate  Coding  through  Postbuffering  . . 

13.  Adaptive  Transform  Coder 

14.  Subblock  Classification  I3ased  on  Energy  (Chen 

and  Smith,  1976) 

15.  Flow  Diagram  of  an  Absolute  Classification 

Procedure  (Cox  and  Tescher,  1976)  

16.  Buffer  Feedback  Logic 

17.  Demonstration  of  "Typical"  Buffer  Feedback 

Behavior 


5 


A.  INTRODUCTORY  CONCEPTS 

Transform  coding  was  invented  in  the  late  1960s.  It  has  been  close  to 
ten  years  since  various  formulations  of  transform  coding  algorithms  have 
been  utilized;  however,  practical  implementations  only  recently  have  been 
conside  r ed. 

The  primary  purpose  of  this  chapter  is  to  review  the  progress  to 
date  on  transform  image  coding.  The  discussion,  in  part,  attempts  to 
present  the  progress  in  the  transform  coding  through  an  historical  review. 

In  addition,  a critical  review  of  this  field  is  provided.  A transform  coding 
algorithm  requires  a relatively  complicated  implementation.  Thus,  impact 
on  hardware  design  also  must  be  considered.  In  addition  to  reviewing  the 
field  as  it  exists,  an  attempt  is  made  to  extrapolate  into  the  future  regard- 
ing potential  implementations.  By  necessity,  the  opinion  presented  here  is 
a personal  hypothesis. 

Because  of  the  complicated  nature  of  transform  coding,  it  is  necessary 
to  demonstrate  that  potential  implementations  are  likely  to  be  of  significant 
advantage.  In  this  introductory  section,  theoretical  background  appropriate 
to  transform  coding  is  reviewed.  The  formalism  is  also  developed  which 
permits  the  analysis  and  critical  review  of  this  technology. 

1.  IMAGE  CODING  MOTIVATIONS 

When  a complex  procedure  such  as  transform  technique  is  considered 
for  data  compression  and,  specifically,  for  image  coding,  it  is  important  to 
trade  off  hardware  complexity  with  potential  benefits. 


preceding  page  rlank 
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Transform  techniques  are  more  complex  than  other  conventional  and 
classical  data  compression  algorithms.  One  should  expect  excellent  perfor- 
mance given  the  complexity  of  transform  algorithms.  It  will  be  demonstrated 
that,  for  nonadaptive  transform  technique  implementations,  only  small  future 
advances  are  expected.  Further  additional  advances  are  expected  primarily 
in  adaptive  techniques. 

2.  THEORETICAL  BACKGROUNDS 

Since  several  excellent  reviews  on  transform  coding  concepts  are 
available  (Huang,  1972;  Huang  and  Tretak,  1972;  Huang,  et  al,  1971; 

Pearson,  1975;  Schreiber,  1967;  Wintz,  1972),  only  a short  discussion  is 
given  here.  In  particular,  Wintz  provides  concise  explanation  on  the  rele- 
vant theoretical  backgrounds. 

First,  a definition  of  transform  coding  is  given.  The  data  com- 
pression technique  transform  or  block  coding  is  a procedure  where  a set  of 
source  elements  is  coded  as  a unit.  The  term  "transform"  indicates  that 
the  original  set  of  elements  is  first  processed  by  an  invertible  transforma- 
tion prior  to  encoding.  Motivation  for  this  procedure  is  provided  by  the 
statistical  characterization  of  typical  pictures. 

Several  recent  papers  have  analyzed  the  theoretical  concepts  associated 
with  transform  coding  (Jain,  1976).  However,  the  original  paper  by  Huang 
and  Schultheiss  (1963)  is  still  an  excellent  basic  reference  for  a description 
of  the  advantages  of  certain  transformations  prior  to  encoding.  The  authors 
demonstrate  that,  for  a correlated  Gaussian  source,  optimum  encoding 
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consists  of  two  stops.  1 host*  arc  the  decor re lation  process  followed  l>y  the 
application  ot  a memory  less  quanti/.er  on  the  output  ol  the  t ru  ns  t o r i iui  t ion. 

It  tlu-  Gaussian  assumption  van  he  extended  to  intake  sources,  then  the 
optimum  encoding  should  always  first  in\  olve  a deco  r re  Lit  ion  procedure, 
for  Gaussian  sources,  dev'  or  r e lat  ion  implies  statistic.il  independence 
( 1 ’apoulis  . 1 d t>  “> ) . i onseque nt  1 v . the  tie v o r re  1 .i t oil  samples  p ro v id o informa- 
tion only  about  themselves. 

I ho  optimum  transform  tor  decorrelation  has  also  been  discussed 
extensively  m the  literature,  and  it  is  usually  identitied  as  the  Kurhunen- 
l.oeve  (K-L)  or  Hoteling  transformation  (llabibiand  Wint/. , 1971).  Thus, 
the  early  concepts  indicated  the  desirability  of  a decor  relating  transform 
and  provided  motivation  for  transform  coding. 

Other  coding  aspects  can  be  related  to  the  relationship  between  com- 
pression rate  and  distortion.  Again,  the  available  literature  is  quite  exten- 
sive. 1'  he  primary  and  obvious  cons  ide  ration  is  that  transform  coding 
introduces  a distortion  which  is  a function  ot  the  desired  compression  rate. 

1 he  understanding  ot  the  tradeoff  between  rate  (R)  and  distortion  (P)  w ill 
lead  to  practical  procedures . i.  e.  , tor  an  at  least  locally  stationary  pro- 
cedure, the  relationship  between  rate  and  distortion  leads  to  useful 
algorithms.  It  is  demonstrated  that  the  rate- versus-distortion  formalism  is 
fundamental  in  the  proper  design  ot  a highly  adaptive  coding  system.  1 he 
appropriate  formalism  is  developed  as  needed  tor  the  appropriate  adaptive 
techniques. 
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3.  I ’RAC  I'lCAL  CONSTRAIN  IS 

In  order  to  establish  a framework  within  which  several  different  and 
possibly  competing  designs  can  be  compared,  it  is  necessary  to  define 
certain  evaluation  criteria.  It  is  attempted  here  to  present  a somewhat 
simplified  but  practical  demonstration  of  various  theoretical  information 
concepts.  t he  concepts  are  presented  as  subsequent  aids  for  tradeoff 
analyses  of  various  designs.  The  following  three  primary  quantities  or 
parameters  art'  considered:  cost  (C),  distortion  (D)f  and  rate  (1\). 
for  example , in  Figure  1 several  curves  are  shown  which  are  represen- 
tative of  the  typical  functional  relationships  among  the  three  parameters. 

Of  these  quantities,  only  the  rate  is  well  defined;  it  will  always  refer  to 
the  appropriate  channel  operation  rate. 

The  term  "distortion  ' is  somewhat  ambiguous.  Frequently,  this 
parameter  is  defined  as  the  mean  square  error.  In  subsequent  discussions 
in  this  chapter,  the  distortion  parameter  provides  the  mechanism  by  which 
the  rate  is  controlled. 

The  "cost''  is  difficult  to  define.  It  can  be  defined  in  several  different, 
although  related,  ways.  Cost  may  imply  financial  investment  in  the  design 
and  hardware  complexity.  In  many  cases,  an  adaptive  technique  by  itself 
may  not  be  too  complicated.  However,  the  required  channel  protection 
necessitates  added  complexity  which  must  be  factored  into  the  overall  design. 

i 

Even  without  the  clear  definition  of  the  distortion  and  the  cost  parameters, 
the  behavior  shown  in  Figure  1 schematically  indicates  the  potential 
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tradeoff  among  the  three  fundamental  parameters:  channel  capacity, 
image  quality,  and  the  complexity  ot  the  design. 

To  be  meaningful,  the  distortion  measure  should  be  related  to  image 
quality.  In  general,  reasonable  correlation  exists  between  distortion  and 
what  appears  to  be  a perceived  image  quality  measure,  based  on  the 
general  behavior  of  the  presented  curves,  several  conclusions  may  be 
drawn.  Regardless  of  cost,  w ith  coding  at  a fixed  rate,  the  attainable 
distortion  or.  equivalently,  image  quality  is  restricted  to  a certain  range. 

In  other  words,  only  limited  gain  is  achieved  l > v going  to  more  complex 
de signs . 

Kquivalently,  for  a desired  image  quality  and  rate',  the'  design  may  be 
rather  expensive.  It  suffices  to  say  that  the  ce'st  aspects  are  complex  and 
numerous.  A fair  portion  of  cost  analysis  to  date  is  either  of  questionable 
value  because  of  the  rapidly  changing  hardware'  technology  or  irrelevant 
because  of  the'  context  in  which  it  was  made*. 

For  example,  in  transform  ending,  it  is  necessary  to  reformat  t he- 
image*  into  blocks,  inasmuch  as  the1  normal  ordering  ot  picture*  elements 
(pixe'ls  or  pe'lsl  follows  the  conventional  raste'r  scan.  I he  input  re'quire- 
ment  by  the-  transform  coder  necessitates  a block  ordering.  1'his  con- 
sideration is  unimportant  for  a video  type'  implementation.  However,  the* 
appropriate'  memory  consideration  for  reformatting  becomes  a major  e'ost 
factor  in  the'  case  of  large  si/e'  image  formats. 

Technology  advances  in  terms  of  spee  l.il  purpose  analog  devices  and 
getu'ral  eligit.il  equipment  have'  e reateel  a sign  die  ant  eTiange  m the'  attitiule'  e>t 
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designers  as  to  what  is  reasonable  to  compute  in  a practical  device.  It  was 
only  a few  years  ago  that  implementation  of  a sinusoidal  type  of  transform  for 
a practical  device  was  totally  unacceptable,  primarily  on  the  basis  of  cost. 

But,  based  on  current  technology,  the  same  computation  may  represent  only 
a fraction  of  the  cost  associated  with  complicated  coding  equipment.  The 
complexity  associated  with  reformatting  is  probably  more  demanding  for 
large  size  image  formats. 

4.  IMPLEMENTATION  CONCEPTS 

Up  to  this  point,  the  design  constraints  introduced  were  in  terms  of 
basic  parameters  of  the  coding  process.  It  is  important  also  to  consider 
building  blocks  of  the  coding  device  from  the  standpoint  of  both  general 
availability  and  hardware  complexity.  In  general,  the  following  device  con- 
siderations apply:  memory,  arithmetic  operations,  control  units,  and 
auxiliary  operations.  Superficially,  these  concepts  also  would  be  utilized 
for  the  design  of  a computer. 

The  similarity  is  not  really  surprising.  An  image  coding  algorithm 
implemented  in  the  digital  domain  is  functionally  a special  purpose  computer. 
However,  fundamental  laws  of  information  theory  also  explicitly  apply  for  a 
data  compression  device.  The  previous  schematic  curves  of  Figure  1 are 
the  result  of  the  appropriate  constraints  of  information  theory.  Thus, 
despite  the  willingness  to  apply  high-speed  arithmetical  operations,  the  poten- 
tial performance  gain  in  additional  compression  may  r.ot  be  too  significant. 

On  the  other  hand,  basic  concepts  associated  with  the  compression  device 
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can  ho  functionally  described  m terms  ot  computer  terminology  such  .is 
moniory  m.ina^cmt'iit  and  a ritlunct  u opo  rations. 


Another  general  concept  of  significant  importance  is  the  relative  com 
plexity  lietwecti  ttie  compressor  and  the  expander.  For  most  implementa- 
tions, the  compressor  is  likely  to  tie  more  complicated  than  the  decoder, 
file  reasons  are  tundamental.  1' he  decoder  is  slaved  to  the  coder;  it  can 
make  no  independent  decisions.  Conversely,  the  coding  device  has  not  only 
the  same  information  available  to  the  decoder,  it  has  access  to  the  original 
source.  Consequent  ly . it  has  the  option  ot  making  decisions  based  on  the 
availability  of  the  original  image. 

I he  distinction  in  complexity  between  coder  and  decoder  is  not  par- 
ticularly significant  for  nonadaptive  algorithms.  It  becomes  rather  impor- 
tant for  adaptive  procedures.  For  more  complex  and  sophisticated  com- 
pression schemes,  the  coder  is  likely  to  be  s ignif  icant  ly  more  complex  than 
the  decoder.  Although  this  observation  is  useful  for  the  design  of  image 
coding  systems,  it  is  unfortunate  that,  for  most  difficult  image  coding 
problems,  the  coder  is  more  constrained  than  the  decoder.  l he  usual  con- 
straints are  hardware  utilization,  power  consumption,  etc.  1 wo  obvious 
examples  are  satellite  applications  such  as  Fandsat  and  the  remotely  piloted 
vehicle.  For  these  applications,  the  coder  complexity  is  severely  limited. 
Conversely,  the  decoding  process  can  allow  tor  relatively  greater  complexity 
(ll.il)ibi,  197  5a;  llabibi  and  Snmulon.  1975). 
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In  terms  of  an  overview,  an  attempt  is  made  in  this  chapter  to 
categorize  various  actual  and  potential  designs  via  the  introduced  concepts. 
Specifically,  memory,  arithmetical  operation,  and  complexity  of  controller 


functions  are  considered. 

S.  COM  Ml' NIC  AT  ION  MODELS 

rhe  various  concepts  are  introduced  as  they  relate  to  image  compres- 
sion. It  is  important  to  realize  that  the  concepts  must  be  consistent  with 
the  fundamental  constraints  of  the  communication  system.  Image  coding  is 
a source  coding  procedure  (Gallager,  1968).  However,  the  source  coding 
procedure  also  must  be  acceptable  to  a realistic  communications  environ- 
ment (Shannon,  Shannon  and  Weaver,  1999).  Ihese  considerations 

are  likely  to  become  important  for  specific  implementations.  Virtually  in 
all  cases,  the  compressed  data  will  be  transmitted  through  a fixed  capacity 
channel.  For  a nonadaptive  data  compression  system,  no  unusual  problem 
exists.  However,  for  a variable  rate  compression  system,  a major  con- 
straint is  introduced.  In  Section  D.  this  problem  is  analyzed  in  detail. 

Another  problem  area  involves  real-time  image  coding  application 
jointly  with  voice  transmission.  An  obvious  example  is  the  picture /phone 
type  application.  Here,  the  constraint  is  that  not  only  the  communication 
system  must  operate  in  real  time,  but  unreasonable  delays  relative  to  voice 
transmission  are  not  acceptable.  1 his  consideration  is  important  whenever 
the  expected  response  from  the  receiver  is  near  instantaneous  after  the 
source  completes  the  generation  of  its  information. 

IS 
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Although  transform  coding  techniques  applicable  to  interlrame 
applications  are  few,  the  previously  introduced  concepts  imply  practical 
•constraints.  In  picture/phone  application,  three-dimensional  transform 
coding  cannot  be  applied  to  more  than  a tew  frames.  It  the  coder  simul- 
taneously operates  on  several  frames,  an  inherent  delay  in  transmission 
with  respect  to  voice  will  develop.  1 lie  same  constraint  does  not  exist  in 
a television  broadcast  application  where  a constant  time  delay  is  unimpor- 
tant. For  variable  rate  techniques,  the  required  buffering  develops  into  a 
valid  and  realistic  constraint. 

Channel  noise  is  a subject  that  also  deserves  comment  within  this 
introductory  section.  However,  it  is  by  design  (based  perhaps  on  a personal 
bias  of  tbis  author!  that  the  source  coding  problem  is  handled  separately 
from  the  channel  coding  problem.  1 lie  latter  problem  is  to  maintain  reliable 
communication  through  a noisy  channel.  Noise  immunity  or  the  lack  ot  it 
m certain  source  encoding  procedures  is  of  some  concern.  Hut  it  is  a 
rather  unreasonable  approach  to  require  that  a source  encoding  procedure 
be  useful  and  operational  in  a noisy  environment.  However,  in  an  overall 
communication  design  problem,  the  channel  coding  problem  shouUi.be 
c a r e f ul  1 y a na  l y / e d . 

basically,  the  following  philosophy  is  offered.  First,  one  should  design 
an  optimum  source  encoding  procedure.  Next,  the  developed  design  should 
be  appended  or  modified  to  be  consistent  with  the  need  for  channel  error 
protection  or  detection  procedures.  This  philosophy  is  likely  to  be  reason- 
able for  most  source  coding,  particularly  for  transform  coiling  procedures. 


Transform  coding  techniques  are  more  complex  than  most  classical 
source  coding  procedures  operating  on  individual  picture  elements.  lhus, 
although  additional  hardware  is  involved,  the  incorporation  of  channel 

I 

coding  increases  the  overall  hardware  complexity  only  slightly.  Several 
studies  analyzed  the  inherent  noise  immunity  of  several  source  coding 
techniques  including  transform  coding.  these  cons ide rations  are  basically 
limited  to  nonadaptive,  fixed  rate  techniques  (Pratt,  et  al . , 1974). 

For  adaptive,  variable  rate  techniques,  additional  considerations  should 
be  followed  when  an  actual  communication  system  is  designed.  One  approach 
is  to  eliminate  bit  errors  via  the  appropriate  channel  coding  implementation. 

T he  second  approach  only  considers  catastrophic  errors.  Again,  the  pri- 
mary relevance  is  to  adaptive  systems.  A catastrophic  error  may  prevent 


the  decoding  of  all  subsequently  transmitted  information. 

The  avoidance  of  catastrophic  errors  can  be  achieved  through  syn- 
chronization  procedures.  This  procedure  periodically  reinitializes  the 
transmission.  If  the  anticipated  bit  error  rate  is  low.  this  procedure  is 
perfectly  acceptable.  This  synchronization  loss  due  to  catastrophic  error 
can  only  affect  a limited  section  of  the  image.  Kxcept  for  occasional 
comments  relating  to  specific  designs,  the  remaining  portion  of  this  chapter 
ignores  the  channel  coding  problem. 

An  additional  reason  for  delegating  the  channel  error  sensitivity  prob- 
lem to  another  field  is  the  constant  research  for  more  efficient  compression 
techniques.  Since  these  techniques  will  be  the  adaptive  and  variable  rate 
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types,  it  is  unrealistic  to  process  the  output  of  these  algorithms  through 
unprotected  noisy  channels.  So,  consequently,  once  a decision  is  made  to 
utilize  a variable  rate  technique,  it  is  necessary  to  eliminate  or  at  least 
minimize  channel  errors.  Failure  to  do  so  prevents  the  utilization  of 
variable  rate  techniques. 

b.  image:  models 

1.  GENERALIZED  IMAGING  SYSTEM 

The  output  to  the  source  encoding  procedure  is  an  analog  function  for 
image  coding.  At  the  destination,  the  image  is  viewed  in  analog  form. 
Transform  coding  similar  to  other  data  compression  systems  utilizes  digital 
techniques.  Yet,  the  basic  information,  the  image,  in  its  original  form  is 
analog. 

In  general,  the  digitized  image  is  considered  as  the  original  input  to 
the  coding  procedure.  Thus,  the  original  analog  nature  of  the  image  is 
ignored.  As  it  will  be  shown  in  this  section,  the  compression -introduced  de- 
gradations nmy  be  of  tin'  same  order  of  magnitude  as  the  errors  introduced 
by  the  imaging. 

A generalized  imaging  system  is  illustrated  in  Figure  2.  It  should  be 
emphasized  that  the  imaging  system  introduces  an  additional  problem  in  the 
sense  that  it  superimposes  its  own  characteristics  on  the  source.  It  is 
stated  that,  in  the  image,  it  is  not  the  natural  redundancy  associated  with  the 
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original  scene,  but  rather  the  impact  of  the  particular  imaging  system  which 
dominates. 


18 


The  image  redundancy  is  quantified  through  its  correlation  model. 
Transform  as  well  as  other  data  compression  techniques  achieve  the  appro- 
priate bandwidth  reduction  because  the  source  (e.  g.  . image)  is  correlated. 

A reasonable  question  may  attempt  to  separate  the  correlation  properties 
between  the  scene  and  its  sampled  “quivaletit.  Reasonable  assumption  can 
be  made  about  scene  correlation.  It  can  be  easily  demonstrated  that  the 
effective  correlation  in  the  sampled  digital  image  is  to  a high  degree  induced 
by  the  very  process  which  produced  the  digital  equivalent  of  the  original 
scene.  While  this  statement  may  sound  reasonably  noncontroversial,  it  is 
fundamentally  different  for  the  two-dimensional  case,  unlike,  for  example, 
voice  compression. 

It  is  appropriate  to  review  the  steps  through  which  image  correlation 
is  introduced.  A primary  observation  is  that  all  image  forming  systems 
act  as  two-dimensional  low  pass  filters  (Goodman,  1968).  However,  these 
low  pass  filters  cannot  have  sharp  cutoffs. 

Although  the  basic  theory  has  been  available  for  some  time,  the  proper 
consideration  of  tills  low  pass  filtering  effect,  including  the  permissible 
classes  of  low  pass  filters,  has  not  been  made.  According  to  Lukosz  (1962), 

f 

the  low  pass  filter  associated  with  an  imaging  system  is  constrained  accord- 
ing to  a specific  function.  This  constraint  is  the  result  of  the  physical 
nature  of  th?  imaging  system.  The  impulse  response  of  the  appropriate 
linear  system  which  represents  the  imaging  system  is  restricted  to  be 
nonnegative. 
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1 he  sair.i-  nonnegative  bound  is  applicable  for  image  sampling. 


s.invlini;  is  pe  rtornu-il  h\  im-.isur  mg  the  local  intensity  in  the  image.  An 
.inerture  is  placed  o\er  the  appropriate  location.  I he  aperture  m that 
lov.il  region  collects  the  intensity  which  provides  the  local  sample  value. 

I ins  aperture  function  is  also  related  to  and  constrained  by  the  l.ukosc 
bound. 

Additional  dilterences  between  one  dimensional  (such  as  voice)  and 
image  compression  techniques  should  be  emphasised.  I ur  a voice  com- 
pression svstein.  the  sampling  principle  is  readilv  applicable.  1 he  appro- 
priate sharp  low  pass  niter  is  available  followed  by  the  sampling  process 
with  a narrow  aperture  function.  I he  latter  closed  approximates  the  1'itac 
delta  function.  Sinnlarlv  , tor  voice  reconst  ruction,  the  sampled  data  can 
lu-  processed  again  by  an  essentialli  perfect  low  pass  filter  resulting  within 
the  given  bandwidth  m an  essentialli  perfect  reconst  rue  t ion  ot  the  otiginal 
\ oice . 

I'or  an  image  compression  svstent,  pertect  reconstruction  is  not 
possible.  I he  primnrv  difficulty  is  the  unavailability  ot  the  required  perfect 
low  pass  niter.  1 he  perfect  low  pass  filter  implies  an  impulse  response 
which  is  analytically  the  sine  function  (doodman.  l‘>oS).  I he  sine  function 
has  negative  sidelohes  which  are  not  realizable  according  to  the  l.ukosz 
bound.  Although  cons  ule  rat  ion  of  the  imaging  system  as  related  to  an  image 
compression  cont igurat ion  is  important  from  the  philosophical  standpoint, 
significant  practical  considerations  also  apply  . Virtually  all  components 
ot  the  conversion  process  trom  tlu-  analog-to-digital  domain  and,  similatly. 
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from  the  digital-to-aualog  domain  invoh  e the  concept  of  realizability 
relating  to  low  pass  filters.  1 he  overall  effect  is  of  considerable  importance 
in  the  design  of  specific  image  compression  algorithms. 


j 

1 he  following  simple  case  serves  as  an  example.  A transmitted  image 
may  contain  an  appreciable  amount  of  high  frequency  information.  Unless 
the  display  device  is  capable  of  reconstructing  the  high  fidelity  information, 
it  should  be  transmitted.  Again  referring  to  Figure  2,  it  may  be  noted 
that  ttie  generation  ot  the  digitized  equivalent  of  the  original  scene  involves 
nonlinear  mappings,  linear  filtering,  and  noise  addition.  These  effects  are 
superimposed  and.  consequently,  become  an  integral  part  of  the  digital 
image. 

In  general,  most  coding  svstems  do  not  explicitly  consider  the  distor- 
tions which  are  introduced  by  the  image  formation  process  and  analog-to- 
digital  conversion.  I he  so-called  "eye"  model  indirectly  includes  indicated 
previous  characteristics.  I'he  simple  Roberts  (19621  technique  successfully 
incorporated  the  insensitivity  of  the  human  observer  into  a coding  scheme. 

This  technique  (similar  to  a class  of  so-called  dither  techniques)  adds  noise 
to  the  image.  Although  the  imagery  is  further  degraded,  the  elimination  of 
the  noticeable  and  unpleasant  contouring  results  in  improved  subjective 

viewing.  ] 

A transform  image  coding  system  introduces  two  types  of  degra- 
dations: a low  pass  filtering  effect  and  additive  noise  introduction. 

The  first  effect  is  peculiar  to  transform  coding.  The  second  effect 
is  common  to  most  noninformation -preserving  data  compression,  and 
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is  the  result  of  t!u-  roqu.i  nt  1 /at  ion  per  tor  med  by  the  coder.  It  the  noise 
prior  to  coding  is  eomp.ir.ihle  to  what  is  introduced  by  requant  l/.at  ion,  the 
\ isible  dec  rail.it  ion  is  acceptable.  1 iie  reader  is  urged  to  consult  the 
classical  references  on  image  formation  and  its  relevance  to  mlormation 
theon  (e.  g.  . t'ornsweet,  t'*70;  Follgett  and  1. infoot,  1 Huang.  l°71; 

Levi.  t"70L 

vini'O  muhfls 

I'he  foregoing  subsection  provides  a short  review  of  generalized  imagin 
systems  Additional  specific  considerations  apply  to  video.  1'he  generalized 
svstem  illustrated  in  Figure  d includes  video.  However,  tor  television- 
related  compression  problems,  the  quality  ot  the  input  to  the  compressor  is 
well  regulated  by  convention.  l‘he  noise  content  and  low  pass  filtering 
associated  with  video  recording  and  display  are  easily  modeled  and  are 
a\  ailable . 

Consequent  Iv , and  with  some  justification,  researchers  ot  the  video 
compression  field  usually  ignore  the  imaging  aspect  ot  video  communication. 
These  researchers  are  working  in  an  industry  well  constrained  by  conven- 
tions. anil  the  industry  is  not  likely  to  change  its  procedures  without  strong 
overriding  reasons  (Fink.  F*sM.  In  addition  to  low  pass  filtering  and  noise 
addition  in  video  recording,  color  and  the  statistics  associated  with  inter- 
frame introduce  additional  modeling  considerations.  For  the  latter  cate- 
gories. insufficient  research  lias  been  performed  to  date  trom  the  standpoint 
of  practical  implemont.it  ion.  The  problem  of  color  intertrame  coding  has 
not  been  considered  for  transform  techniques. 


Transform  coding  is  attractive  for  video  systems.  One  reason  for 
tins  is  the  relatively  low  quality  ot  video.  Here,  the  reference  to  quality  is 
made  in  the  relative  sense.  1 lie  comparison  is  with  optical  projection  or 
with  film-based  systems.  Thus,  quality  limitation  of  the  video  system 
permits  further  degradations  to  he  introduced  by  the  coder. 

The  other  singular  aspect  of  the  video  system  is  the  fairly  small  format 
image  size.  The  typical  order  of  magnitude  ot  •■■00  x ‘'00  is  not  likely  to 
change.  Consequently,  the  appropriate  memory  requirements,  such  as 
buttering,  are  relatively  straightforward.  In  contrast,  a potentially  high 
quality  transmission  system  of  still  photographs  would  involve  image  sizes 
several  orders  of  magnitude  larger  than  current  formats  for  video. 

Within  the  framework  of  a critical  overview,  some  comments  pertaining 
to  video  systems  are  appropriate.  The  general  assumption  should  not  be 
made  that  what  is  acceptable  to  qualify  for  video  is  equally  acceptable  tor 
higher  quality  display  systems.  Another  observation  relates  to  the  develop- 
ment of  new  television  cameras.  Specific  reference  is  made  to  charge- 
coupled  device  (CCD)  type  imaging,  where  the  quality  of  recorded  informa- 
tion may  significantly  improve.  Consequently,  if  v ideo  systems  improve 
with  the  availability  of  improved  equipment  such  as  solid  state  cameras, 
the  permissible  amount  of  degradation  introduced  by  the  image  compression 
system  will  have  to  be  decreased. 

3.  STATIONARY  AND  NONSTA  TIONA  RY  MODELS 

Following  the  description  of  the  physical  model  of  image  formation,  it 
is  important  to  review  the  appropriate  available  mathematical  models. 


The  common  statistical  models  are  limited,  although  they  are  useful  for 


the  analysis  of  image  compression  systems.  However,  these  models  are 
insufficient  to  achieve  maximum  compression. 

A few  general  comments  relative  to  modeling  are  appropriate  prior  to 
the  analysis  of  the  actual  mathematical  structures.  The  implementation  of 
a mathematical  model  for  an  image  compression  system  is  the  mechanism 
by  which  the  a priori  information  is  incorporated  in  the  image  coding 
process  (Jones,  1976).  By  a mathematical  model,  one  refers  to  information 
equally  available  to  both  the  transmitter  and  the  receiver.  Most  current 
mathematical  models  are  based  on  stationary  statistics  as  evidenced  by 
implementations.  In  contrast,  nonstationary  statistics  actually  imply  the 
lack  of  a useful  model. 

Generally,  modeling  refers  to  a parameterization  procedure  within  the 
coding  technique.  In  terms  of  nonstationary  modeling,  a '’learning"  pro- 
cedure can  be  utilized.  The  learning  function  introduces  a hierarchy  of 
procedures.  This  learning  procedure  may  be  based  on  the  original  source 
in  which  case  the  appropriate  model  also  must  be  transmitted.  The  model 
could  also  be  derived  from  information  already  transmitted  to  the  receiver. 
In  this  case,  the  model  generation  is  without  any  overhead. 

The  conceptual  schematic  diagram  of  this  type  of  modeling  approach 
is  shown  in  Figure  3.  An  essentially  recursive  approach  to  image  model- 
ing is  introduced.  In  subsequent  sections  this  concept  is  placed  on  a more 
concrete  basis  and  related  to  specific  algorithm  development. 
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Chi-  conventional  mathemat ical  model  for  image  coding  is  the  so-called 
Markov  model  (llal)il)i  and  Wint/..  197b).  In  this  case,  the  image  is 
descrihahle  l>y  a correlation  function  which  is  the  product  ot  exponentials. 
This  model  is  restricted  to  one  or  two  parameters  describing  the  two 
orthogonal  directions  m the  image  plane.  An  alternative  form  assumes 
rotational  symmetry  tor  the  correlation.  Despite  its  simplicity,  the  Markov 
model  has  achieved  reasonable  success  for  many  implementations.  In  tact, 
m recent  times  the  Markov  model  has  been  extended  to  the  time  variable 
interframe  coding  (Koese  and  I’ratt.  197b).  l'lie  popularity  of  the  Markov 
model  is  the  fact  that  it  lends  itself  to  convenient  analysis.  In  other  words, 
making  the  assumption  that  the  image  is  a Markov  process  permits  a certain 
amount  of  performance  evaluation. 

However,  the  simple  Markov  model  has  two  basic  limitations.  ( 1 1 It 
assumes  that  the  image  correlation  and  equivalent  power  spectrum  can  be 
well  modeled  by  the  appropriate  expressions.  lHit  why  should  a separable 
correlation  model  be  pertinent  to  an  image  ’ The  particular  symmetry  may 
be  introduced  through  the  process  of  converting  the  image  into  the  digital 
form.  (.1)  The  Markov  correlation  model  does  not  take  into  account  the 
probable  uncorrelated  component  that  is  representative  of  image  noise.  In 
fact,  the  basic  limitation  of  the  Markov  model  is  not  its  relative  inaccuracy 
but  that  it  ignores  image  noise  effects. 

The  assumption  of  the  Markov  model  validitv  is  reasonable  for  the 
original  analog  scene.  The  additional  noise  component  in  the  digitized 
image,  particularly  at  higher  spatial  frequencies,  significantly  degrades 


C onsideration  ot  noise  in  image  coding  suggests  tli.it  an  effu  ient  i oiling 


procedure  is  likely  to  utilize  a prefilter  prior  to  coding,  i >t  course,  a 
tilter  prior  to  the  compression  process  alters  the  actual  model.  llus 
author  is  ot  the  opinion  that  efficient  compression  techuupics  are  adaptive, 
t herefore,  utilization  of  the  simple  Markov  model,  while  usetu!  tor  analysis 
purposes,  represents  a basic  limitation  on  the  compression  system. 

Image  nonstat  ionar  it  y can  he  demonstrated  hy  numerous  examples. 

I wo  specific  cases  are  of  particular  interest:  t he  nonstat  lotta  r it  x within  the 
original  scene  and  the  nonstat  iona  rity  introduced  and  super  imposeit  bx  the 
imaging  process.  The  tirst  case  is  almost  obvious.  One  has  only  to  look 
at  any  scene  within  his  own  surroundings  and  simply  observe  the  variations 
in  the  local  structure.  The  second  case,  the  effect  of  imaging  process,  is 
somewhat  more  subtle.  l'he  primary  concept  here  is  that  ot  an  image 
typically  mputed  into  an  image  coding  system  as  a two-dimensional  image. 
This  two-dimensional  representation  is  generated  from  the  original  three- 
dimensional  scene. 

Kx-en  without  attempting  to  cover  image  formation  in  great  detail,  it  is 
obvious  that,  in  general,  only  part  ot  the  scene  is  sufficiently  in  focus  xxlule 
other  parts  are  out  ot  locus.  It  is  inefficient  to  represent  the  entire  scene, 
including  the  in  anil  out  of  locus  regions,  hy  the  same  statistical  model. 

Vet.  no  a priori  information  is  available  that  would  indicate  to  the  decoder 
which  parts  ot  the  scene  are  in  and  out  of  focus. 
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Ihe  imago  nonsiationarity  consideration  suggests  that,  rather  than 
modeling  individual  pixels,  one  could  model  groups  of  pixels.  One  may 
consider  a set  of  picture  elements  as  an  element  of  an  ensemble  of 
obse  rvations. 

In  image  coding  similar  to  estimation  theory,  one  utilizes  two  sets  of 
quantities:  the  random  variables  (picture  elements!  and  the  relevant 
parameters  associated  with  the  random  variables.  In  image  coding,  the 
parameterization  is  often  ignored,  l’or  a more  advanced  modeling  approach, 
a two- level  processing  is  proposed.  The  coding  of  picture  elements  is 
performed  as  before,  but  one  does  not  have  the  exact  information  about  the 
relevant  parameters.  Consequently,  a parameter  estimation  is  also 
requi red. 

Ihe  nonsiationarity  concept  through  a vector  model  is  somewhat  uncon 
ventional.  However,  a vector  model  is  natural  for  transform  coding.  In 
its  more  advanced  utilization,  parameterization  results  in  a more  efficient 
data  compression  which  also  includes  highly  nonstat ionu  ry  techniques. 

S.  IMAGE  QUALITY 

The  efficiency  of  an  image  compression  technique  can  be  quant  it  at  ivel\ 
evaluated  by  comparing  the  change  in  the  image  quality  between  the  original 
and  the  decoded  image.  Unfortunately,  image  quality  is  not  well  defined  m 
terms  of  mathematical  formalism.  I he  problem  of  quantifying  the  effects 
of  the  compression,  as  well  as  any  other  processing  step,  is  common  to 
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several  techniques  and  not  to  transform  coding  alone.  It  is  necessary  to 
consider  the  image  quality  problem;  otherwise,  the  discussed  results  are 
not  meaningful. 

Researchers  have  generally  utilized  the  mean  square  error  as  the 
qunatity  representative  of  the  image  quality.  While  mean  square  error 
techniques  have  been  used  successfully  in  many  areas  of  optimization 
problems,  the  same  evaluation  procedures  have  been  of  limited  use  in  the 
application  of  image  coding.  The  mean  square  error  is  appropriate  to  the 
analysis  of  a stationary  process.  However,  the  concept  of  nonstationarity 
reduces  the  usefulness  of  the  classical  quality  measure  in  image  coding 
application. 

The  mean  square  error  measure  has  been  modified  for  image  coding 
applications,  as  well  as  for  general  image  quality  analysis,  to  include  an 
essentially  prewhitening  filter.  The  improvement  although  real  is  limited. 
In  fact,  the  "eye"  model  in  image  coding  modifies  the  mean  square  error 
formalism  by  providing  a shaping  factor  for  the  image  power  spectral 
dens  ity. 

It  is  necessary  to  quantify  the  efficiency,  at  least  in  a relative  sense, 
of  various  transform  coding  procedures.  One  could  also  compare  the 
various  techniques  in  a common  simulation.  However,  in  most  cases  this 
is  not  practical.  The  comparison  of  different  techniques  at  a single  facility 
is  a significant  effort;  furthermore,  many  compression  techniques  have 
been  designed  for  particular  image  types. 
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However,  there  are  some  future  directions  specifically  for  image 
coding  application.  l'he  mean  square  error  quantity  is  somewhat  super- 
ficial. because  it  is  a global  measure  and  does  not  indicate  the  distribution 
of  the  error  within  the  image.  Visual  demonstration  of  the  error  distribu- 
tion (here,  the  error  is  the  pixel-by-pixel  difference  between  the  original 
and  the  decoded  image)  can  yield  a meaningful  evaluation  measure.  It  is 
reasonable  to  assume  that  the  residual  error  between  the  processed  and  the 
original  image  should  have  small  spatial  structure  and  should  be  uncorre- 
lated with  the  image. 

Consequently,  a successful  evaluation  measure,  which  also  suggests 
adaptive  techniques,  should  result  not  only  in  a small  mean  square  error 
for  a "good"  decoded  image  but  also  an  error  image  that  would  appear  as 
white  noise.  Visual  presentation  of  the  appropriate  error  image  is  indica- 
tive of  the  benefits  of  adaptive  techniques.  As  shown  in  Figure  4,  a 
nonadaptive  technique  generates  a local  error  structure  that  is  highly 
correlated  with  the  original  image. 

C . TRANSFORM  CODING 

The  basic  block  diagram  representation  of  a transform  coding-decoding 
system  is  shown  in  Figure  5.  The  coding  system  consists  of  a reformatting 
memory  followed  by  the  transformation  and,  finally,  the  actual  coding 
process.  The  receiver  is  the  mirror  image  of  decoder. 

Because  of  the  availability  of  published  material  (Tescher,  1975, 

1976.  1977;  Wintsr. , 1972),  including  tutorial  articles,  this  section  is 
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limited  to  .1  critical  review  rather  than  a comprehensive  discussion  of  all 
apsects  ot  transform  coding.  l'he  primary  subjects  addressed  are  the 
actual  transform  and  the  coding  of  the  transform  coefficients.  Unfortu- 
nately. most  of  the  research  work  has  directly  related  to  transform 
algorithms  and  not  to  what  should  be  done  once  the  t ransformation  is  accom- 
plished. In  the  opinion  of  this  author,  the  emphasis  on  the  transform 
algorithm,  essentially  at  the  expense  of  the  coding  procedures,  has  resulted 
in  a suboptimum  level  of  progress  tor  transform  coding. 

1.  Ft  ASIC  CONCEPTS 

Numerous  formal  papers  in  the  literature  demonstrate  the  principal 
advantages  of  transform  coding.  Here,  an  attempt  is  made  to  justify  trans- 
form coding  based  on  first  principle.  However,  the  mathematics  is  kept 
rather  simple. 

l'he  basic  justification  of  transform  coding  was  offered  by  Huang  and 
Schultheiss  (1963).  1'hey  demonstrated  that  the  optimum  coding  procedure 
for  a correlated  Gaussian  source  consists  of  two  steps.  Decor  relation 
results  m an  independent  source  which  is  optimally  coded  by  a memoryless 
coder.  This  basic  concept,  and  the  appropriate  mathematical  formalism, 
is  sufficient  to  introduce  the  K- L transform.  The  source  uncor relation 
produces  uncorrelated  quantities  that  form  the  input  to  the  coder.  Karly 
work  in  transform  coding  resulted  in  actual  utilization  of  the  K-l.  transform 
and  provided  impressive  results  (llabibi  and  Wintz,  19711. 
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Another  observation  is  that  the  Fourier  transform  "tends  to  approach 
the  K-L  transform  with  increasing  transform  size.  Consequently,  the 
Fourier  transform  can  successfully  approximate  the  K-L  transform  for 
reasonably  large  transforms. 

Thus,  the  basic  transform  coding  philosophy  requires  an  uncorrelating 
transform.  Furthermore,  the  Fourier  transform  is  a close  approximation 
to  the  actual  K-L  transform.  Another  important  concept  is  adaptivity. 

For  this  case,  no  general  theoretical  solution  is  available. 

Combination  of  the  decorrelation  transform  with  adaptive  coding  is  a 
powerful  method  of  data  compression.  1'he  discussion  of  specific  algorithm! 
is  deferred  to  subsequent  sections. 

2 . R F V 1 F \V  O F C LASS1C  A L V EC  1 ' N IQU ES 

It  is  appropriate  to  provide  a short  historical  review  ot  transform 
coding.  Early  research  on  transform  coding  primarily  concentrated  on  the 
transform  algorithms.  I lie  Fourier  transform  was  found  to  be  a good 
a pp  r oxima  t ion  to  the  K-L  transform.  However  , the  early  problem  with 
Fourier  techniques  involved  another  classical  behavior  of  Fourier  analysis 
which  has  been  referred  to  as  the  Gibbs  phenomenon  (Uracewell,  LUG). 

I bis  term  really  refers  to  a particular  behavior  ot  the  Fourier  transform 
as  a method  for  the  approximation  of  functions  with  discontinuities.  1 lie 
Fourier  transform,  when  used  for  approximation  (interpolation),  replaces 
the  discontinuity  with  the  average  value  of  the  neighborhood  of  the  discon- 
tinuity. The  disc  retired  implementation  of  the  Fourier  transform  creates 
difficulties  related  to  the  Gibbs  phenomenon. 


I'he  Fourier  integral  is  defined  over  t he  entire  real  axis.  However. 


the  discrete  Fourier  series  is  defined  over  a finite  interval.  The  additional 
property  of  the  Fourier  series  is  that  it  assumes  the  function  it  approxi- 
mates is  a periodic  function.  l'he  basic  period  corresponds  to  the  finite 
interval  over  which  the  Fourier  series  is  defined.  The  practical  implica- 
tion is  that  the  numerical  Fourier  transform  assumes  the  first  and  the 
last  points  of  the  basic  interval  are  neighbors.  The  Gibbs  phenomenon  is 
applicable  to  the  Fourier  series.  Thus,  the  discontinuity  between  beginning 
and  end  of  the  basic  period  is  approximated  by  the  Fourier  series  with  the 
appropriate  average  values. 

In  the  practical  sense,  the  discontinuity  problem  of  the  discrete 
Fourier  transform  in  early  applications  of  transform  coding  has  been 
significart.  I'he  Fourier  transform  approximation  when  used  over  small 
blocks  produces  undesirable  blocking  effects.  Several  attempts  have  been 
made  to  minimize  this  blocking  problem  (Anderson  and  Huang.  19711. 

In  recent  years,  it  was  realized  that  significant  improvement  can  be 
obtained  in  eliminating  the  blocking  problem  by  introducing  forced  sym- 
metry. l'he  basic  solution  is  rather  simple.  The  original  subblock  is 
replaced  by  its  symmetrized  version  as  shown  in  Figure  b.  lire  Fourier 
transform  is  applied  to  this  new  larger  subblock.  Since  only  the  even  terms 
are  nonzero,  the  number  of  nonzero  output  elements  is  identical  to  the 
number  of  input  elements.  Moreover,  this  larger  subblock  has  no  discon- 
tinuities between  the  appropriate  boundary  points.  Consequently , the 


in 


modified  block  structure  dors  not  eliminate  the  Ciibbs  Phenomenon  but 
rather  the  original  discontinuity  that  creates  the  Ciibbs  phenomenon. 

It  is  surprising  that  this  symmetrized  approach  to  Fourier  coding  only 
recently  was  discovered.  It  has  been  demonstrated  that  the  symmetrized 
Fourier  transform,  which  has  been  designated  as  the  cosine  transform  in 
the  literature,  closely  approaches  the  K-b  transform  for  many  practical 
applications  (Ahmed,  ot  al.  . 197-lb  Recent  studies  further  demonstrates 
that  the  cosine  transform  is  virtually  identical  to  the  K- L transform  for 
numerous  practical  conditions  (Jain,  1970). 

The  cosine  transform,  unlike  the  K-L  transform,  can  bo  implemented 
numerically  through  a fast  ' transform.  More  importantly , the  cosine 
transform  virtually  approaches  the  K-l.  transform  performance  with  neither 
utilization  nor  knowledge  of  the  source  correlation.  For  the  K-l.  transform 
the  source  covariance  model  must  be  available  to  derive  the  actual 
transform  matrix. 

The  cosine  transform  is  a strictly  deterministic  transform.  Con- 
versely, the  K-L  transform  is  a class  of  transformation,  and,  for  each 
application,  it  is  a function  of  the  appropriate  covariance  matrix.  It  is  a 
significant  practical  benefit  that  the  single  deterministic  transformation 
closely  approaches  in  performance  the  entire  class  of  theoretical  optimum 
transformation. 

The  considerable  amount  of  past  research  effort  involving  the  various 
types  of  transforms  appears  to  be  of  somewhat  questionable  value.  Farly 
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techniques  included  utilization  of  several  transforms  including  the  slant, 
Hadamard,  and  a large  class  of  transforms  which  can  be  derived  from  the 
first  two  transforms  (Dail,  1976;  Pratt,  etal.  , 1974;  Andrews,  1975). 


'I 


The  transform  research  may  be  summarized  in  two  conclusions:  (1)  No 
deterministic  transform  has  equaled  the  performance  of  the  cosine  trans- 
form, and  (2)  no  theoretical  justification  was  offered  in  the  first  place  as  to 
why  the  "other'1  transforms  should  be  beneficial. 

When  this  author  attempted,  at  an  earlier  date,  to  produce  a justifica- 
. tion  for  the  Walsh  functions,  the  only  reason  he  could  find  was  that  these 

i 

functions  were  similar  to  the  Fourier  basic  functions  (Tescher,  1973). 

Thus,  the  Hadamard/ Walsh  techniques  succeed  as  essentially  ad  hoc 
approximations  to  the  Fourier /cosine  techniques.  A further  fundamental 
observation  is  that,  unlike  the  Fourier  transform,  the  various  other  trans- 
forms will  not  asymptotically  approach  the  K- L transform  performance  with 
increasing  transform  sizes. 

Two  other  considerations  of  transform  algorithms  are  the  appropriate 
computability  and  the  size.  Under  computability,  the  complexity  of  the 
potential  device  for  implementation  of  the  transform  is  defined.  Fortu- 
nately, other  than  the  general  K- L transform,  most  of  the  useful  transfor- 
mations are  implementable  via  fast  algorithms  (Cooley  and  Tukey,  1965). 

The  suboptimal  Hadamard  and  Haar  can  be  implemented  without  multiplica- 
tion. For  specialized  applications,  a computationally  more  efficient  trans- 
form, even  at  the  expense  of  performance  loss,  may  be  preferred.  For 
example,  a specific  tradeoff  may  involve  the  Haar  transform  which  has  a 
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fast  algorithm  without  any  multiplication.  On  the  other  hand,  various 
hybrid  implementations  for  the  cosine  transform,  e.  g.  , through  C CD  tech- 
nology, minimize  the  advantages  of  the  suboptimal  transform  ( \V hitehouse , 
et  al.  , 1 975). 

Transform  size  is  an  important  practical  consideration.  1’he  argument 
is  often  made  that.no  benefit  is  obtained  by  choosing  transforms  larger  than 
the  image  correlation  distance,  assuming  the  latter  quantity  is  available. 

This  approach  is  artificial  and  ignores  the  tact  that  the  transform  uncorre- 
lates only  the  pixels  within  the  subblock.  It  will  not  uncorrelate  the  pixels 
among  subblocks.  A typical  subblock  consists  of  8x8  or  16  X 1 1>  pixels. 

The  appropriate  reasoning  is  that  the  image  correlation  is  not  likely  to 
exceed  8 or  16  pixels,  respectively. 

Although  this  reasoning  has  not  been  challenged,  it  is  not  valid.  Even 
if  all  pixels  within  the  transform  block  become  decorrelated  via  the  trans- 
formation, the  pixels  on  the  border  remain  correlated  with  respect  to 
pixels  on  the  borders  of  adjacent  subblocks.  Consequently,  the  argument 
against  using  large  subblocks  to  achieve  improved  image  decorrelation 
is  not  proper  as  it  is  based  on  the  correlation  distance.  Recent  work 
with  relatively  large  size  (256  X 250)  transforms  demonstrated 
signilicant  performance  gains  over  small  size  implementations  (l’escher, 
1973;  Tescher  and  Andrews,  1979).  However,  tor  practical  reasons  it  is 
advantageous  not  to  exceed  16  X 16  or  12  x 12  size. 

An  additional  argument  against  the  larger  size  is  the  concept  of 
adaptivity.  It  is  still  an  unresolved  question  how  to  optimize  the  transform 
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i H\.sidtT.itum  dependent  on  local  image  slruiturc.  I'lifso  lim  unia-pts  are 
cont  radictorv  ami  the  solution  is  not  at  all  obv  ious.  lo  obtain  maximum 
ilocor  m latiou  . iiutimsiiii;  tho  sim  is  beneficial.  s om  rrsrlv.  to  adapt  to  (lie 
liu  a l 1 111a  gc  st  riK-turi' . a sniallrr  b l o c k s l r is  proto  rrril, 

A It  hough  it  was  oa  rlior  a r jjuoil  b\  \\  i nt  z 1 1 d < .’  I that  utilizing  s ma  I lo  r 
blocks  permits  implemcntat  mu  ot  adaptive  models,  Ins  suggest  ions  bavo 
not  boon  followed.  l'lu*  ounoopt  ot  adaptivity  favors  small  subblocks.  the 
same  i oiuopt  int  roduoos  son  s iili*  rat  ion  ot  overhead  inlornut  ion.  In  general, 
the  ovorlioail  as soc iatod  with  an  adaptive  transform  algorithm  is  likely  to 
booiim'  mom  important  with  do  rousing  transform  block  si/rs. 

In  siimmarv.  thro  o primary  eons  ido  rat  ions  relate  to  translorm  si/o. 

1 a r re  r s i z o s minimize  block  to  blook  c o r r s'  la  t io  n . Ada  pt  i v e p r oo  e d u r o s 
are  likely  to  favor  smaller  sizes.  f inally,  the  overhead  information  per 
subhloek  should  not  exceed  a reasonable  traction  ot  the  available  bandwidth. 

Another  potential  study  relates  to  the  (spatial)  shift  variance  impact  ot 
tin1  transform.  In  general,  t ranstorin  coding  is  a shitt  v ariant  procedure. 

I he  degree  ot  shift  variance  is  a function  ot  the  transform  size  as  well  as  the 
local  image  origin.  Cons  ide  ration  ot  the  shift  variance  problem  has  been 
generally  ignored. 

An  example  of  a simple  shift  variant  filtering  is  the  removal  ot  the 
v isually  undesirable  blocking  introduced  at  extreme  compression  rates.  I he 
required  filter  will  only  smooth  block  boundaries  without  much  proi  essmg 
vv  ithm  hhu  ks  . 
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I he  general  effect  of  quant  izat  ton  of  tin-  soura'  is  .1  primary 
cons ide ration  for  all  source  coding  procedures.  Quantization  is  the  non- 
invertible  mapping  from  the  analog  source  to  its  discretized  equivalent. 

Thus,  the  originally  continuous  parameters  will  he  represented  by  integers. 

In  general,  the  quantizer  is  the  major  error  source  in  image  coding.  In 
most  cases,  quantization  error  is  the  only  distortion  present  in  the  decoded 
image.  All  other  error  sources,  such  as  numerical  roundoff,  are  negligible 
in  most  instances.  It  not,  they  can  be  minimized  or  even  eliminated  by 
increasing  register  sizes  and  by  using  integer  arithmetic.  However,  the 
quantization  is  a fundamental  distortion  and  in  most  cases  it  is  unavoidable. 

As  shown  in  l'igure  7,  the  quantizer  is  a mapping  from  the  continuous 
variable  domain  of  transform  coefficients  into  the  domain  of  integers. 

1'hese  integers  become  the  code  words  that  are  transmitted  through  the 
channel  (O'Neal.  1971).  An  alternative  superior  approach  is  a two-phase 
coding  technique  in  which  the  integers  are  the  secondary  input  into  an 
entropy  coding  processor.  The  output  of  this  coder  generates  the  final  code 
words  to  be  transmitted  through  the  channel.  The  two-phase  coding  approach 
except  for  Tasto  and  Wintz  (1971),  has  not  been  pursued.  Practical  diffi- 
culty with  entropy  coding  is  that  it  is  an  open  loop  procedure. 

The  quantizer  output  is  an  integer  which  is  also  the  code  word  for  one- 
step  coding.  The  relevant  optimization  procedure  is  the  minimization  ot 
mean  square  error  between  original  and  quantized  coefficients.  Since  all 
practical  transform  coding  systems  utilize  unitary  transforms,  the  mean 
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square  error  is  preserved  under  the  transformation.  I'hus,  quantizer 
optimization  in  the  transform  domain  guarantees  optimum  spatial  domain 
performance  in  the  minimum  mean  square  error  sense. 

For  a given  transform  domain  model,  the  quantization  parameters  are 
easily  obtained  via  the  Max  (I960)  quantizer  algorithm.  Knowledge  of  the 
probability  density  function  ot  the  random  variable  tor  a specified  number 
of  quantization  bins  permits  computation  of  the  required  thresholds  and 
reconstruction  levels  through  the  Max  algorithm  or  approximations 
(Panther  and  l)ite,  PP'l). 

I'he  actual  coefficient  quantization  is  performed  in  two  steps:  (1)  l he 
coefficient  is  normalized  by  its  estimated  variance,  and  (.’)  the  normalized 
variable  is  processed  by  the  optimum  quantizer  based  on  the  modeled 
probability  density  function  of  unit  variance.  l he  number  of  bits  for  a 
quantized  coefficient  is  determined  by  relating  the  assumed  prequantized 
variance  to  distortion.  The  result  can  be  derived  through  several  different 
ways  including  formal  rate  distortion  theory  (llabibi,  t'liMi).  1 o maintain 
equal  distortion  for  each  quantized  coefticient,  the  required  number  of 
quantization  bins  should  be  proportional  to  the  standard  deviation  of  the 
quantity  to  he  quantized.  Thus,  the  appropriate  bit  assignment  procedure 
follows  directly  from  this  argument.  The  coefficients  are  generally  assumed 
to  be  Gaussian  random  variables.  In  recent  experiments,  the  exponential 
density  function  has  been  found  more  appropriate  (Teseher  and  Cox,  PCo). 
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[t  should  be  noted  that  the  Max  quantizer  minimizes  the  global  error 
between  input  and  output,  usually  mean  square  error.  A more  difficult 
problem  is  accurate  determination  of  the  prequanti/.er  normal  ization  factor. 
Phis  problem  is  tin*  determination  of  the  transform  domain  model.  I'he  basic 
question  is  how  to  normalize  each  of  the  transform  coefficients  prior  to 
quantization. 

Most  investigators  assume  the  Markov  correlation  model  for  the  image 
from  which  the  generalized  power  spectral  density  is  derived.  Ihis  model 
is  convenient  and  only  requires  one  or  two  parameters.  1 or  the  latter  case, 
the  image  correlation  in  orthogonal  directions  is  assumed  to  be  different. 

I’he  required  normalization  factors  also  can  be  obtained  directly  from  a 
class  of  images  through  a learning  procedure  in  which  the  appropriate  trans- 
form coefficients  of  one  or  several  images  are  averaged  m the  root  mean 
square  sense.  This  approach  is  found  to  be  superior  to  the  Markov  model 
and  it  produces  the  required  normalization  parameters  directly. 

A disadvantage  of  the  second  approach  is  that  transmission  of  the 
normalization  parameters  may  significantly  tax  the  available  bandwidth.  I or 
a separate  training  set.  a mismatch  may  develop  between  the  training  set 
and  the  image  to  he  coded.  1 he  logistics  of  determining  the  normalization 
parameters  for  different  image  classes  may  become  substantial. 

An  alternative  technique  by  Teacher  (I97t)  requires  no  a prion 
development  ot  normalization  parameters.  I'he  necessary  parameters  are 
obtained  recursively  from  previously  quantized  transform  coefficients  and 
may  be  determined  in  real  time.  Although  the  early  work  was  promising. 
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it  lias  not  been  pursued  by  other  researchers  primarily  because  of  its 
complexity.  The  implementation  was  over  large  transforms.  It  was  also 
an  open  loop  procedure;  thus,  the  actual  compression  rate  could  not  be 
specified  in  advance.  These  early  problems  have  been  resolved  and  the 
procedure  is  discussed  in  Section  C . -I. 

Since  the  recursive  model  is  attractive  for  image  compression,  its 
theoretical  advantages  should  be  discussed.  The  previous  two  quantization 
models  (the  Markov  and  the  training  set  model)  are  utilized  in  deterministic 
fashion.  In  contrast,  the  recursive  model  considers  the  normalization 
parameters  to  be  a set  of  highly  correlated  random  variables.  The  recur- 
sive model  generates  the  normalization  parameters  from  the  already 
decoded  values.  Thus,  the  recursive  approach  to  transform  image  coding 
allows  an  improved  utilization  of  the  transform  domain  without  any  overhead 
and  prior  model  assumptions. 

Several  additional  considerations  should  be  mentioned.  Clearly,  the 
mathematical  complexity  has  increased.  Both  the  estimation  procedure  and 
the  determination  of  the  number  of  bits  for  each  coefficient  require  real- 
time computation.  The  same  hardware  must  also  be  included  in  the  decoder 
since  it  performs  the  same  recursive  computation.  The  results  of  the 
computation  in  the  decoder  must  be  identical  to  that  ot  the  coder;  otherwise, 
synchronization  loss  may  develop.  This  requirement  implies  not  only 
performance  of  the  identical  estimator  procedures  but  also  implementation 
of  the  appropriate  algebraic  steps  in  the  identical  order  accuracy. 
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I he  potential  problem  is  that  the  roundoff  error  may  produce  different  bit 


assignments  between  the  coder  and  decoder.  This  technique,  like  all 
variable  rate  adaptive  techniques,  is  sensitive  to  channel  errors.  Ihus, 
potential  propagation  of  catastrophic  errors  must  be  avoided  by  periodic 
resynch  r on  i zat  ion. 

I hus  far  in  the  discussion,  no  attempt  has  been  made  to  specify  the 
transformation.  However,  one  could  argue  that  the  quantization  model  and 
its  subsequent  utilization  are  at  least  as  important  as  the  transform  itselt. 
lhe  various  quantization  models  arc  equally  applicable  to  all  t ranstormations. 

I'he  importance  of  the  proper  quantization  procedure  cannot  be  over- 
emphasized. lhe  cosine  transform  decorrelates  most  practical  types  ol 
imagery.  both  theo ret ic ally  and  practically,  the  oisine  transform  closely 
approximates  the  K 1.  transform  performance  tor  reasonable  block  size's  ot 
S v S pixels  or  greater.  Thus,  little,  if  any,  additional  gain  can  be-  obtained 
by  the  optimization  of  the'  transform  itself.  lhe  real  gain  is  in  tlu*  moeli'ling 
and  the  appropriate-  quantization  strate'gv  (Renele'r.  19T">;  Ue'is.  e*  t a 1 . . t ' > 7 of 
Knt  ropy  ending  and  or  e.irefully  de-sigiu-d  quantization  slrute-gv  are  more 
important  than  e»pt  imiz.ut  ion  of  transforms  (Huffman.  \ pr.ntie.il 

difficulty  with  conventional  e-nt  ropy  coding  is  that  it  eloe's  iu>t  guarantee  a 
spec  if  ie  rate.  It  max  not  own  achieve  bandwidth  compression.  I'he'  ge'iu-ral 
ized  adaptive'  coiling  strategy  is  also  applicable  to  entropy  ending,  (.'omen 
tional  e'pe'n  loop  e'litropy  coding  can  be'  conve' rte'd  into  a e loseel  le'op  syste'm 
through  the  ge-no ral i zed  adaptive-  meidel. 


For  completeness,  the  so-called  threshold  coding  should  he  mentioned 
(Anderson  and  Huang,  1971).  Here,  coefficients  are  quantized  which  exceed 
a fixed  threshold.  Threshold  coding  utilizes  all  the  undesirable  features  of 


the  various  quantization  models.  It  is  open  loop,  highly  error  sensitive. 

Its  coefficient  modeling  is  poor  since  each  coefficient  is  processed  by  the 
same  quantizer  if  it  is  above  the  threshold.  Specification  of  the  coefficients 
to  be  transmitted  requires  considerable  overhead.  Threshold  coding 
approximates  entropy  coding  in  complexity  without  benefit  of  the  latter 
technique. 

3.  I MACK  QUALITY  DEGRADATION 

For  image  coding,  in  general,  and  transform  coding,  in  particular, 
image  quality  measures  which  quantitatively  correlate  well  with  the  perceived 
image  quality  and  degradation  as  introduced  by  the  coder,  ate  unavailable. 
This  is  true  even  though  conventional  measures,  such  as  the  mean  square 
error  and  its  derivatives  (e.  g.  . the  weighted  mean  square  error),  are 
helpful  for  comparison  purposes.  However,  these  measures  are  limited, 
in  an  absolute  sense,  for  comparison  between  the  original  and  processed 
images.  Even  without  useful  quantitative  measure,  degradations  associated 
with  transform  coding  can  be  discussed. 

Three  classes  of  degradations  can  be  identified.  (1)  The  coefficients 
are  replaced  by  their  quantized  equivalent.  (.!)  Some  of  the  coefficients  are 
replaced  by  zero.  In  effect,  a great  deal  of  the  bandwidth  compression  is 
tlie  result  of  this  low  pass  filtering.  (3)  The  transform  coding  procedure  is 
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not  s hilt  inva  r la nt . 


1 ho  processing  is  performed  over  adjacent  blocks. 


[t  a new  global  origin  is  chosen  for  the  process,  the  results  are  different. 

Visually,  these  effects  can  be  separated  in  most  instances.  Coeffi- 
cient quant i cation  introduces  apparent  additional  image  noise.  Replacement 
of  coefficients  by  zero  in  the  decoder  is  a low  pass  filtering  effect.  It 
observable,  the  image  will  appear  exactly  as  such.  1 he  decoded  image 
compared  with  the  original  image  may  loose  some  ot  its  details.  Low  pass 
filtering  is  rather  noticeable  at  low  data  rates.  Although  this  effect  is 
transform  dependent  even  for  the  cosine  transform  at  low  rates,  the  visual 
blocking  at  the  transform  boundaries  is  clearly  undesirable.  I he  term 
"low  bit  rate  indicates  that  only  a small  fraction  of  coefficients,  say  10'. , 
will  be  transmitted.  The  same  problem  is  further  magnified  with  suboptimul 
t ransfo  rms . 

For  the  Fourier  transform,  the  Gibbs  phenomenon  is  discussed  earlier 
in  the  chapter.  The  cosine  transform  is  less  sensitive  to  this  blocking  effect. 
However,  in  the  limit,  it  also  introduces  blocking. 

Although  reliable  mathematical  measures  are  not  available,  techniques 
that  indicate  image  quality  degradation  in  the  decoded  image  compared  with 
the  original  are  available.  1’he  "error  image"  is  usually  helpful.  The 
absolute  value  of  the  difference  between  the  original  image  and  the  decoded 
image  defines  this  quantity.  For  display  purposes,  the  error  image  is 
usually  scaled  by  a factor  of  from  10  to  40.  If  the  information  loss  is  small, 
the  information  present  in  the  error  image  should  also  be  small.  For  a 
high  quality  coding  system,  the  error  image  will  appear  to  be  white  noise. 
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4. 

In  most  cases,  the  one  who  implements  and  the  one  who  designs  an 
image  compression  system  utilize  the  output  of  some  imaging  device  without 
any  interaction  or  consideration  of  its  original  design.  Historically,  this 
procedure  lias  been  common.  Television,  as  an  example,  was  designed 
primarily  for  a particular  image  quality  operation.  The  developer  of  the 
video  compression  algorithm  cannot  interact  with  the  design  of  the  television 
system.  In  principle,  one  would  desire  an  interaction  between  the  two. 

Transform  coding  algorithm  achieves  its  bandwidth  reduction,  in  part, 
by  not  transmitting  the  entire  transform  plane.  Thus,  a nonadaptive  trans- 
form algorithm  particularly  at  low  rates,  acts  as  a low  pass  filter.  A 
general  imaging  system  also  acts  as  a low  pass  filter.  Thus,  the  required 
low  pass  filtering  of  the  compression  procedure  could  be  achieved  equally 
well  in  the  analog  domain.  If  the  low  pass  filtering  is  performed  by  the 
imaging  device,  the  required  sampling  rate  also  could  be  reduced.  The 
equivalent  rate  reduction  is  achieved  by  reducing  the  number  of  picture 
elements  representing  the  original  analog  image  source. 

The  analog  low  pass  filtering  is  not  equivalent  to  low  pass  filtering 
performed  by  the  transform  algorithm.  Unlike  the  transform  coding 
algorithm,  the  imaging  device  is  approximately  a spatially  invariant  filter. 

These  comments  represent  an  oversimplification.  In  principle,  the 
algorithm  input  should  be  projected  back  into  the  analog  domain.  The 
digital  image  format  is  an  intermediate  step.  Consequently,  various 
aspects  of  the  transform  coding  algorithms  could  be  performed  in  the 


imaging  device  itself.  Similarly . if  the  imaging  device  as  a low  pass  tiltei 
is  mismatched  to  the  sampling  procedure,  e.  g.  . in  the  case  >f  oversampling, 
tlu-  apparent  successful  bandwidth  compression  is  the  result  of  low  quality 
image  generation  in  the  digital  domain  and  not  necessarily  evidence  of 
efficient  coding. 

S.  LUNDAMLNTAL  LIMITATIONS  OI-  N ON  ADA  1 ’T  IV! 

TRANSFORM  COIHNC, 

Various  degradations  associated  with  transform  coding  are  described 
in  previous  sections,  l or  a nonudaptivc  technique  with  a specified  rate,  the 
degradations  are  essentially  unavoidable.  Other  than  changing  the  bit  rate, 
these  degradations  are  deterministic.  A nonadaptive  algorithm  is  designed 
to  be  a fixed  coding  algorithm  operating  identically  for  all  images  and  all 
image  regions. 

g he  question  is:  What  is  optimality  ' Lor  a nonadaptive  technique,  the 
image  to  be  coded  is  assumed  to  be  a stationary  source.  Were  the  sta- 
tionarit\  assumption  valid,  a nonadaptive  coder  could  be  an  optimal  image 
coder.  In  general,  the  stationaritv  assumption  is  not  a good  one. 

Images  usually  have  different  statistical  structures,  both  from  image 
to  image  as  w ell  as  w ithin  an  image.  Image  nonstationa  r its  results  not  onl\ 
from  scene  nonstationa  ritv  but  also  from  the  imaging  process  itselt.  An 
imaging  device  projects  a three-dimensional  scene  onto  a two-dimensional 
plane.  Consequently,  those  parts  of  the  image  in  focus  require  high 
fidelity  and.  equivalently,  a substantial  portion  of  the  available  bandwidth. 


Those  segments  that  are  out  of  focus  (say,  the  background)  appear  to  be 
blurred,  thus  requiring  only  a small  fraction  of  the  bandwidth. 

This  qualitative  discussion  suggests  that  the  stationarity  model  for 
image  coding  might  be  a severe  design  limitation.  In  his  review, 

Wintz  (1972)  indicated  that  nonadaptive  transform  coding  is  of  little,  if  any, 
benefit  over  other  conventional  but  simpler  image  coding  techniques 
(Habibi,  1971).  The  real  gain  is  in  adaptivity.  Unfortunately,  research 
advances  obtained  to  date  are  limited. 

In  the  next  section,  primary  concentration  is  on  nonstationary  models, 
algorithm  implementations,  and  relevant  concepts. 
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! i . ADAPTIVITY  CONCEPTS 

Several  years  ago,  W intz  (1972)  urged  the  implementation  of  adaptive 
t ech nique  s . In  this  review,  essentially  a halt  decade  later,  one  would  be 
pleased  to  discuss  the  numerous  algorithms  and  implementations  dealing 
with  adaptivity.  Unfortunately,  this  situation  has  not  developed.  Most  tech- 
niques have  followed  primarily  nonadaptive  procedures. 

The  original  suggestions  for  adaptivity  in  transform  coding  are  valid 
today.  In  this  section,  the  principles  of  adaptivity  are  reviewed.  Further- 
more, various  implementation  methods  are  discussed.  Unfortunately, 
the  available  literature  on  which  to  base  this  analysis  is  rather  limited,  and 
the  following  discussion  is  heavily  based  on  the  author  s own  research. 

1.  NONSTATIONARY  IMAGE  MODEL  CONSIDERATIONS 

Here,  the  assumption  is  made  that  imagery  as  a source  should  be  mod- 
eled as  a nonstationary  representation.  Consequently,  coding  procedures, 
which  m a large  extent  freeze  the  algorithm,  are  not  appropriate.  In  the 
following  discussions,  the  image  coding  model  is  part  of  the  information  to 
be  utilized  bv  the  decoder.  This  fact  does  not  necessarily  indicate  that  new 
or  additional  modeling  information  must  be  transmitted  to  the  decoder.  For 
an  efficient  system,  the-  modeling  information  can  and  should  be-  derivable 
from  previously  decoded  information. 

The  basic  assumption  is  made  that  the  image  coding  procedure  is  the 
transmission  of  fluctuation  information  about  a preassigned  model.  However, 
this  model  is  part  of  the  information  to  be  transmitted.  Ideally,  the  model 
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would  permit  local  changes  through  parameterization.  More  importantly, 
it  would  assist  in  meaningful  redistribution  of  the  available  bandwidth  among 
partitions  of  the  image. 

However,  before  proceeding  with  various  adaptive  algorithms,  it  is 
necessary  to  review  the  impact  of  a realistic  communication  system  model 
as  a const  raint . 

1.  COMMUNICATION  MODKL  AS  A PRACTICAL  CONSTRAINT 

Although  image  coding  is  applicable  for  non  real-time  applications  such 
as  storing  pictorial  information  on  magnetic  tape,  the  primary  application 
is  real-time  image  transmission.  Thus,  the  image  coder  must  be  consis- 
tent with  the  constraints  of  a realistic  communication  model.  The  appro- 
priate limitation  is  straightforward. 

With  increasing  adaptivity,  the  coder  output  will  fluctuate  in  rate  if 
the  adaptivity  permits  a variable  rate  compressor.  The  relevant  problem 
is  how  to  interface  the  variable  rate  compressor  with  the  fixed  rate  channel. 
For  a variable  rate  compressor,  communications  model  constraints  become 
a necessary  consideration. 

In  principle,  the  solution  is  simple.  The  compressor  (source  encoder) 
must  be  interfaced  with  the  communication  channel  through  a rate  equalizing 
buffer.  This  buffer  permits  the  deviation  in  bits  from  the  average  rate  as 
required.  However,  the  practical  solution  is  somewhat  involved. 

Another  practical  consideration  of  a communication  channel  relates  to 
channel  errors.  The  various  applicable  channel  coding  techniques  can  be 
classified  into  two  groups:  In  one  case,  through  algebraic  coding,  channel 
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imtoi'h  ,i  ft*  e s sent  i.t  1 1 y t*  1 i m mat  ed . In  1 lu*  other  i.isc,  .i  ( i\ed,  iionnegliglblc 
rati*  of  ilumu'l  errors,  including  catastrophic  errors,  is  permitted.  1 low  - 
ever,  periodic  r e init  in  l i / at  ion  ot  the  algorithm  is  performed. 

In  general,  it  is  difficult  to  evaluate  the  impact  ot  channel  errors  tor 
adaptive  procedures  . Krrotieous  hits  ma\  be  s igni  t icanl  I v important,  parti 
eular l\  for  overhead  intor mation.  On  the  other  hand,  erroneous  hits  that 
re  suit  onl  v in  itnorrei  t reoonst  ruction  ot  a single  I ratistorm  i oeltu  ient  may 
result  m relatively  minor  image  degradation  t 'otisei|uenl  1 \ . the  primar\ 
impact  ot  .1  communication  svsletn  is  the  need  to  interlace  the  inherent  l\ 
fixed  rate  transmission  system  with  a localh  tluctuating  rate  source  en 
coiling  system.  The  appropriate  problem  is  to  design  a global  lixeil  rate 
system  that  is  highly  rate  adaptive  locally  . 

>.  I 'i  IM  S OT-  AP  \l'l  l\  1 IO 

\ d a nt  iy  it  y procedures  can  he  classitu'il  min  two  hri'ad  c a t e go  r i e s . hoi 
the  first,  the  source  coder  rate  is  constant.  However,  various  other  coder 
parameters  are  c hanging.  For  the  second,  in  addition  to  other  parameters, 
the  local  c iiinpr  e s s ion  rate  is  also  variable  1 he  see  one!  ada  pt  iv  i t y category 
involves  i mp!  e me  nt  a I io  n d i f I i v u 1 1 v he  c an  si*  o I the  varying  rate. 

In  addition  to  c la  s s it  ic  at  t on  according  to  rate,  other  cons  ider.it  ions  an 
appropr  late . For  a highly  adaptive  system,  the  degree  ot  adaptivity  or  the 
number  of  diflerent  yv.ivs  the  voder  c an  operate  may  be  either  a large  mini 
her  or  essentially  a continuous  parameter.  In  this  case,  the*  overhead  in 
formation  needed  by  the  decoder  to  determine  the  operation  modi*  should  be 
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doeodable  from  previously  decoded  data.  Otherwise,  the  transmitted  over 
head  would  require  a greater  fraction  of  the  available  bandwidth. 

A somewhat  simplified  adaptivity  procedure  may  allow  a few  operational 
modes,  say  four,  and  within  each  class  the  appropriate  coding  algorithm 
may  be  considered  to  be  a nonadaptive  procedure.  This  type  of  approach  has 
several  advantages  despite  its  limitations.  Basically,  for  a small  number  of 
algorithm  modes,  one  designs  several  independent  coding  algorithms  corre 
spending  to  the  different  classes.  Thus,  other  than  determining  within  which 
class  the  algorithm  currently  operates,  the  decoder  is  essentially  nonadaptive. 

The  appropriate  modeling  is  similar  to  a Markov  chain.  For  each  sub- 
block, the  source  corresponds  to  one  of  the  states  of  the  Markov  chain,  and 
the  coder  processes  the  data  according  to  the  same  state  classification.  For 
the  four -class  example,  two  bits  must  be  allocated  for  overhead  per  subblock. 
Consequently,  the  available  bandwidth  is  primarily  utilized  tor  the  trans 
mission  of  transform  coefficients  for  any  reasonable  size  transform  block, 
say  8X8  or  larger. 

Before  proceeding,  a short  overview  of  a specific  transform  coding 
system  that  has  been  developed  is  in  order.  A system  built  at  NASA  by 
Knauer  ( 1 B ) and  his  coworkers  is  the  three-dimensional  implementation 
of  the  relatively  simple  algorithm  originally  proposed  b\  Landau  and  Slepian 
( 1 '*7  I ).  The  NASA  model  is  an  adaptive  interframe  coder  utilizing  the 
■1  \ -I  x I (fadamard  matrices.  The  system  operates  at  a constant  rate.  The 
appropriate  normalization  model  (i.e.,  the  quantizer!  varies  according  to 
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loc  al  ac t i vi  l v . 


Ihe  impU‘iiu‘iitat  ion  is  straightforward  since  t!u’  rato  is  tixed 
A frame  storage  is  required  for  four  framrs.  I In-  appropriate  transform  is 
three  dimensional  lladamard.  Two-bit  overhead  indicates  the  classification 
intor  mat  ion  to  the  decoder.  1 lit*  system  operates  at  one  hit  per  pixel  with 
excellent  imago  quality  compared  with  the  original  video  image.  Ihe  actual 
implementation  of  the  algorithm  has  been  pertormed  at  regular  I \ rates. 

In  Figure  S.  a schematic  description  ot  tins  algorithm  is  shown.  I he 
mot  i vat  ion  to  assign  different  classes  is  based  on  temporal  aetivit\  in  the 
image.  Thus,  the  amount  of  image  motion  within  four  frames  determines 
the  allocation  of  the  available  bits  in  tin'  temporal  direction.  Since  the  over- 
all rate  is  fixed,  the  total  number  ol  bits  for  each  4 x 4 X 4 transform  block 
is  constant. 

I he  discussed  algorithm  at  one  bit  per  pixel  is  probably  close  to  the 
lowest  rati-  at  which  the  system  could  operate  with  acceptable  image  quality  . 
•r-1  i_  . . . ..  ■ ...  I .1  : : .,..1,  o,  ......  ,1 1 o 1 I t t- -in  e for  t , > lilin  I 

l lie  ucis  u r*  y ft  ivii  t luimauutt  10  v v i . »<•»»-'  ■ • 

utilizes  an  identical  fraction  of  the  available  bandwidth.  For  an  intertrame 
coding  problem,  the  local  variability,  both  spatially  and  temporally,  is  likely 
to  be  significant.  The  current  NASA  concept  is  the  only  existing  real- 
time  transform  coding  system  either  for  int  raframe  or  intertrame  coding. 

The  NASA  design  incorporates  several  diagnostic  modes,  which  also 
serve  an  educational  purpose  by  demonstrating  various  motion  types  within 
an  image.  It  is  also  helpful  to  design  quantizer  parameters  associated  with 
various  classes.  Other  than  the  requirement  for  the  four  frame  memory. 


the  system  design  is  straightforward.  Implementation  through  the  lladamard 
matrices  eliminates  multiplications. 
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\ a rial) U*  rati*  source  coding  techniques  require  rate  equalizer  buffers. 

I his  requirement  is  ease  to  demonstrate.  Since  the  communication  channel 
operates  at  a fixed  rate,  the  source  coder  rate  fluctuation  must  be  absorbed 
b\  the  additional  buffer.  1 he  buttering  design  is  important. 

I he  general  model  of  a variable  rate  system  is  shown  in  Figure  9. 

I he  rate  equalizing  buffer  is  implemented  between  the  source  coder  and  the 
c hannel.  I'he  important  consideration  is  how  the  source  coding  system 
controls  gross  parameters  ot  the  decoder.  In  a realistic  system,  the  source 
coder  cannot  operate  at  variable  rate's  without  some  externally  controlled 
mechanism  . An  open  loop  source  coding  system  is  not  acc  ept  able* . 1 his 

statement  is  consistent  with  tlu*  assumed  philosophy  ot  nonst ationar it v . hve 
for  a variable  rate  system,  image  nonstat ionarit \ may  be  more*  extreme  thai 
what  the  rate  equalizer  buffer  could  allow  without  an  additional  controlling 
mechanism. 

For  a transform  coding  procedure  (f  igure  "d  a 'prebutter  is  required 
This  buffer  performs  the  image  reformatting  into  subblocks.  For  a variable 
rate  system,  the*  additional  buffer  is  required  to  accommodate  tlu*  source' 
coder  fluctuations.  Fbe  basic  problem  is  te>  determine  how  te")  control  tin* 
overall  coding  parameters  in  terms  of  utilization  ot  tin*  various  butters.  I m 
general  solutions  are*  available. 

I he  tirst  approach  uses  the  reformatting  butter  t e>  centred  the  coding 
parameters  . ! he  secetnd  approach  uses  the  rate*  equalizer  butter  in  a tcoel 

back  loop.  I he  two  techniques  arc  tie  > t equivalent.  I be*  first  technique  is 


logically  i‘asv  to  understand;  however.  it  is  loss  efficient.  1!>  ro.  tho  rclor 
matting  buffer  serves  as  a control  mechanism.  Although  general  designs  ar< 
deterred  to  the  Section  1).  5 , the  principle  of  butler  control  is  introduced 
he  re . 

One  assumes  that  the  adaptivity  procedure  does  not  refer  to  the  entire 
image  but  is  restricted  to  only  that  portion  of  the  image  which  resides,  at 
any  one  time,  in  the  reformatting  buffer.  Consequ  ntly,  each  image  segment 
corresponding  to  tho  prebuffer  is  transmitted  at  a f.xed  rate  eor  res  ponding 
to  the  required  channel  rate. 

Larger  reformatting  buffers  result  in  more  adaptive  systems.  For  con- 
ventional raster  type  imago  structure,  the  transform  blocks  are  nX  n 
subblocks.  Thus,  the  minimum  prebuffer  size  is  n lines.  For  a ^12  x M2 
image,  with  16  \ In  transform  subblocks,  the  adaptivity  is  implemented 
over  32  subblocks.  The  result  is  a significant  redistribution  of  the  available 
bandwidth  over  a reasonable  image  size.  If  each  image  segment  correspond- 
ing to  the  prebuffer  is  coded  at  the  channel  rate,  the  coding  system  operates 
at  the  appropriate  channel  rate. 

The  logical  operational  approach  is  to  code  the  entire  prebuffer  and  to 
place  the  output  into  the  "postbuffer."  13 y design,  the  postbuffer  is  tilled  it 
the  ratio  of  memory  sizes  is  equal  to  the  appropriate  ratio  associated  with 
the  image  compression.  The  compressed  data  may  be  transmitted  through 
the  channel  at  its  fixed  rate.  A short  delay  will  develop  corresponding  to 
coding  the  image  segment  that  fills  the  prebuffer.  For  a practical  imple- 
mentation. two  memories  could  be  used.  While  compressed  data  is 
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transmitted  from  the  equalizer  buffer,  another  compressor  operates  on 
another  identical  prebuffer.  The  delay  is  also  a result  of  the  finite  time 
required  for  classification.  The  reformatting  operation  time  can  be 
minimized.  Each  line  segment  can  be  placed  in  the  proper  location  of  the 
prebuffer  in  real  time.  Thus,  after  the  last  segment  is  available,  the  coder 
can  immediately  begin  its  operation. 

However,  the  controller  must  look  at  the  entire  image  region  residing 
in  the  reformatting  buffer  in  order  to  arrive  at  the  correct  control  parameter. 
Hence,  the  engineering  design  must  take  into  consideration  two  types  of 
delays.  One  delay  is  associated  with  image  line  reformatting  into  subblocks. 
The  additional  delays  are  required  by  the  controller.  Thus,  the  controller 
should  be  fast,  because  during  the  operation  additional  buffering  arrangement 
must  be  made  to  accept  incoming  data. 

Increasing  the  region  over  which  the  algorithm  is  adaptive  can  be  accom- 
plished only  by  increasing  the  reformatting  buffer.  For  example,  to  double 
the  adaptivity  area,  the  prebuffer  size  must  also  be  doubled.  It  should  be 
noted  that  the  prebuffer  contains  the  image  in  full  resolution.  An  increase 
in  prebuffer  size  refers  to  the  original  image  representation.  The  smallest 
prebuffer  size  is  determined  by  the  minimum  needed  for  reformatting.  It  is 
n lines  for  an  n X n transform  coder. 

The  general  concept  of  prebuffering  may  be  understood  by  referring  to 
Figure  10.  Several  nonoverlapping  image  segments,  which  reside  one  at 
a time  in  the  prebuffer,  are  shown  with  the  appropriate  D vs  R figures. 

For  each  image  segment,  a distortion  measure  is  associated  with  the 
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Figure  10.  Variable  Rate  Coding  through  Prebviffering  (Continued). 

[The  coding  system  is  shown  in  (c).  The  decoding  sys- 
tem is  shown  in  (d).  ] 


i i pri'sM.m  rati-.  I'he  dist  rtmn  paramet.  r n ay  be  isidered  as  a global 
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A 'teutiaih  ".e  gat  iv  e aspect  at  prebutter  utili  atien  is  image  qua  l it  v 
diseent inuit v between  adjacent  image  segments.  Ibis  statement  is  more 
understandable  through  a specific  simple  exan  pi  i- . A snail  area  i \ . r 
responding  to  a transtorm  blink,  is  presont  in  two  different  image  regtoi  . 
which  oeeupied  the  reformatting  butter  at  different  tir  .-s  B\  ass1.;: 
one  region  is  busy  and  the  oilier  one  is  emi  t.  In  the  busv  regie  . ivt  • 
bits  will  be  available  for  \ than  in  the  quiet  image  region.  If  the  s.r  sub 
blocks  are  also  adjacent  to  each  other,  the  apparent  image  qualm  a-s  • .to  . 
with  then  will  be  significantly  different,  because,  in  one  case,  the  i oiler 
allocated  more  bits  to  the  same  subblock  than  it  did  in  the  other  case.  I bis 
example  is  illustrated  in  figure  It. 

I he  prebuffer  segments  the  image  into  nonovo  rl.ippmg  regions  corre- 
sponding to  the  reformatting  buffer  si.  e or  its  multiple,  ll  pertorms  a rate 
controlling  function  for  ouch  rionoverlunning  region.  \ controller  design 
through  a reformatting  butter  is  inflexible.  l'his  system  is  hoaeilv  image 
format  dependent.  Furthermore,  it  is  not  easily  adaptable  to  an  image 
coding  application  for  varying  image  sizes. 


| hi  'rcbulforing  i oncept  cl  iminates  tin*  ciitl  iculties  a ss  nc  lated  u ith  an 
open  loo  p < i p<  ■ r .it  ion . I lowever,  di  1 1 icult  i es  and  di  sadvanl  ages  remain  with 
a prebuffer  ut ili.'.ation.  Another  buffering  approach,  postbuffering,  eliminates 
t the  indicated  disadvantages. 

In  Figure  Id.  i nstbutfer  control  communication  system  utilizing 
adapt  ice  transtorm  iodine,  is  illustrated.  It  should  be  no  led  that,  although 
advantages  are  highly  applicable  to  transform  codint;,  this  approach  to  adap 
live  coding  is  general.  l ints,  the  same  concept  is  equally  applicable1  to 
I ransiorni  and  other  toiling  proc  rdures  who  rr  a 1 ccdhack  is  i n iplement  abl  e . 

1 ' t i li /.at  ion  of  a rate  oqnali  cr  buffer  with  feedback  is  not  now.  However,  the 
mode  of  feedback  mechanism  implementation  is  novel. 

I' he  actual  solution  involves  modification  of  the-  distortion  parameter 
based  on  an  in  ago  region  that  ptvvioush  -esided  in  the  equal  i /.or  butter. 

[he  earlier  introduced  representation  of  the  P vs  R curve  will  he  utilized. 

1 , ointmller  constantly  adjusts  the  appropriate  distortion  parameter  to 

yield  the  required  rati-. 

i ie  basic  problem  is  how  to  deal  w itli  uonstat i ona r ity  for  overlapping 
image  regions.  For  each  region,  one  must  approximate  the  P vs  R curve, 
iubsequent  Iv , tin-  corred  distortion  parameter  to  yield  the  appropriate  rate 
is  determined.  By  design,  adjacent  image  regions  signiticantlv  overlap. 

For  example,  two  adjacent  image  regions  (sac.  A and  It)  contain  a common 
area  exceeding  ” ' pe  r c en  I lor  each  s egm  en  I . 

1 . M-  this  example,  the  appropriate  1 ' y s K curves  will  only  va ry  a small 
amount  between  the  two  segments.  I his  result  is  expeeted  since,  except  tor 
1 1*  a e 1 1 ( in  o t each  segment,  the  two  are  identical. 
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figure  12.  Variable  Kate  Coding  through 
I’ostbuffering.  l l'he  great’v 
overlapping  image  segments  of 
(a)  result  in  almost  identical 
O vs  K curves  in  (b).  | 
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One  may  assume  that  the  appropriate  distortion  parameter  is  available 
for  segment  A.  Within  that  segment,  utilization  of  the  appropriate  11  para- 
meter yields  the  required  rate.  Note  that  the  local  fluctuation  is  not  con 
strained.  One  attempts  to  have  a global  control  parameters  for  segment  A. 
which  on  the  average  yields  the  correct  rate.  For  segment  R,  it  is  reason- 
able to  utilize  the  same  distortion  parameter.  Therefore,  by  design,  the 
controller  chooses  that  distortion  term  for  segment  B.  The  coder  will  pro- 
cess the  input  for  a small  fraction  of  that  segment. 

At  the  conclusion  of  coding  of  the  last  small  segment,  the  average  rate 
over  the  past  image  segment  will  differ  from  the  desired  rate.  1 hus.  the 
rat»-  equalizer  buffer  acquires  a small  bias.  Consequently,  the  distortion 
parameter  must  be  modified  to  adjust  the  rate  properly.  Furthermore,  the 
coder  must  also  be  adjusted  to  counteract  the  bias  in  the  rate  equalizer  butter. 

Operationally,  one  does  not  have  the  11  vs  R curve.  However,  its  local 
history  is  obtained  in  terms  of  samples  on  the  curve.  Through  those  samples, 
the  coder,  based  on  the  past,  can  estimate  the  desired  H value  bv  linear 
interpolation  utilizing  the  two  previous  elements  on  the  H vs  R curve.  I he 
estimation  could  also  be  extended  to  the  differential  ot  this  curve.  I he  given 
approach  could  be  implemented  bv  methods  ot  Kalman  filtering!  however, 
this  has  not  yet  been  attempted. 

The  estimation  procedure,  in  general,  requires  a local  estimate  of  both 
the  distortion  parameter  and  its  change  ot  rate  with  respect  to  R.  Since,  for 
a practical  communication  system  the  differential  will  have  the  same  sign,  a 
simplified  estimator  could  be  utilized. 
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The  indicat i'(l  conceptual  procedure  is  a buffering  technique  th.it  utilizes 
the  important  parameters,  i.  e.  , the  buffer  size  and  its  projections  into  the 
image.  this  technique  is  stable  because  variations  in  buffer  status  are 
immediately  ted  back  into  the  control  operation. 

Before  specific  implementations  are  considered,  the  general  usefulness 
of  the  discussed  approach  can  be  compared  with  the  controlling  operation 
through  prebuffering.  Ihe  coding  mechanism  via  postbuffering  is  highly 
flexible.  Other  than  the  reformatting  buffer,  the  system  is  completely 
format  independent.  The  reformatting  buffer  performs  no  cont  rolling 
operation.  Consequently,  the  coder  can  be  utilized  for  different  image  for- 
mats by  modifying  the  reformatter. 

This  consideration  may  be  important  it  a general  adaptive  transform 
coder  is  to  be  developed.  Since  this  system  will  be  of  considerable  com- 
plexity. it  would  be  desirable  to  use  the  same  design  for  various  applications 
including  different  image  types  and  formats. 

The  coder  operates  with  one  block  of  data  at  one  time;  therefore,  the 
postbuffering  concept  can  be  extended  to  include  several  multiplexed  coders 
that  utilize  a single  postbuffer.  The  postbutter  controls  each  coder,  k.ach 
transform  block  is  treated  independently  by  the  coder. 

Similarly,  the  coder  may  operate  on  subblocks  in  a nonsequential  order. 
The  rate  fluctuation  from  block  to  block  will,  of  course,  differ  for  an  alter- 
nate ordering;  however,  the  coding  efficiency  should  remain  the  same. 

This  consideration  is  not  entirely  artificial.  Transform  coding  requires 
a considerable  number  of  arithmetical  operations.  At  video  rates,  this 
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requirement  may  bo  quite  demanding.  In  order  to  reduce  the  effective 
computation  rate,  several  coders  may  multiplex  their  output  into  a single 
rate  buffer. 

Another  observation  is  that,  unlike  the  case  of  prebuffer  control,  no 
discontinuity  in  image  quality  among  different  image  regions  develops  as  a 
result  of  adaptive  coding.  The  distortion  parameter  D is  updated 
continuously.  Two  similar  adjacent  subblocks  are  not  likely  to  be  coded 
differently  because  of  discontinuity  in  the  l>  parameter.  To  minimize  the 
problem  of  image  quality  discontinuity,  the  postbuffer  should  bo  sufficiently 
large,  say,  two  or  more,  image  strips. 

Memory  utilization  is  improved  through  postbuffer  control  compared 
with  prebuffering.  The  postbuifor  control  operates  through  the  compressed 
image,  unlike  the  prebuffer  control.  I'o  improve  adaptivity  with  postbuffer 
control  is  loss  demanding,  since  the  necessary  buffer  increase  is  in  terms 
of  a compressed  image. 

Another  advantage  of  postbuffer  control  is  parallel  controller  operation. 
Hv  design,  this  function  may  be  simultaneously  performed  with  the  coding 
operation.  Consequently,  the  requirement  for  additional  buffering  to  permit 
the  controller  to  determine  the  appropriate  distortion  parameter  is  not 
necessary . 

The  controller  updates  the  distortion  parameter  in  parallel  significantly 
less  frequently  than  the  actual  arithematic  operation  rate  required  for  the 
coder.  Thus,  the  computational  requirement  for  the  controlling  system  is 
limited.  It  should  also  be  noted  that  the  buffer  ctmtrol  is  strictly  based  on 


butter  behavior,  and  the  required  information  is  based  on  previously  trans 
mitted  data . Sitiee  the  decoder  performs  the  same  turn  lion,  the  entire  con- 
trol operation  can  be  performed  with  no  overhead. 

S.  L,K ARMING  PROCFDURKS 

Compression  algorithms  yyith  separate  adaptivity  classes  require  pre- 
training prior  to  actual  coding,  for  these  algorithms,  a parameter i/.ation 
is  required  in  order  to  relate  to  local  image  structure.  In  order  to  perform 
the  adaptive  coding,  it  is  necessary  to  develop  techniques  which  meaningfully 
assign  the  mode  of  the  image  coder  into  one  ot  available  classes. 

In  general,  it  is  necessary  to  develop  training  procedures  that  yield  the 
appropriate  coding  classes.  1'hus,  prior  to  the  actual  coding,  a training 
procedure  must  be  implemented.  I'he  result  ot  this  training  of  the1  coder 
must  also  be  available  to  the  decoder  as  well.  It  requires  considerable' 
overhead  to  transmit  this  information.  Consequent ly,  the  training  procedure 
is  not  likely  to  be  performed  frequently. 

Alternatively,  several  types  of  training  images  may  be  available  to  both 
coder  and  decoder.  When  coding  a specific  image  prior  to  transmission  of 
the  compressed  image,  the  required  parameters  arc-  specified  by  indicating 
the  appropriate  training  image. 

Performance  variation  ot  an  adaptive  technique  based  on  classification 
procedures,  which,  in  turn,  are  based  on  "typical"  image  identification, 
makes  evaluation  of  the  adaptive  algorithm  difficult.  First,  one  must  eval- 
uate the  capability  and  performance  of  the  algorithm.  Second,  the  algorithm 
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sensitivity  to  the  specification  of  typical  image  sots  should  be  considered. 

Thus,  an  efficient  compression  algorithm  can  become  suboptimal.  The  im- 
plementation can  be  significantly  degraded  if  a mismatch  develops  between 
the  image  to  be  coded  and  the  image  which  was  used  for  training. 

Adaptive  techniques  with  a fixed  number  of  operating  modes  are  attrac- 
tive. The  overall  system  complexity  must  include  the  training  procedures 
and.  consequently,  it  becomes  rather  involved.  An  equivalent  adaptive 
technique  that  eliminates  the  training  procedure  is  discussed  in  Section  4.5. 
Although  the  source  coder  complexity  increases,  the  overall  system  com- 
plexity is  reasonable.  Besides  a higher  degree  of  coder  adaptivity,  the  need 
for  training  procedures  is  eliminated. 

6.  C IjASSIFIEIRS 

In  this  section,  the  discussion  is  limited  to  adaptive  techniques  which 
utilize  a finite  number  of  classes.  For  each  class,  a different  coding  mode 
is  utilized.  The  conceptual  difference  should  be  noted  between  a classification 
process,  which  is  a response  to  local  image  structure  fluctuation  over  sub- 
block size,  and  buffer  control.  The  latter  function  operates  over  much  larger 
image  regions.  It  introduces  a small  perturbation  in  the  coding  process  to 
achieve  the  necessary  fixed  rate  output  averaged  over  many  subblocks. 

To  perform  the  classification,  the  coding  algorithm  performs  some 
measurement  and.  through  an  appropriate  deterministic  procedure,  each 
subblock  is  assigned  to  one  of  the  finite  number  of  classes.  The  considerable 
amount  of  research  in  pattern  recognition  performed  to  date  has  had  little 
or  no  impact  on  transform  image  coding  algorithms. 
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Two  types  of  simple  classifiers  have  been  developed.  Only  one  has  been 
used  extensively.  The  first  classification  process  lias  utilized  the  subblock 
energy  as  primary  input  to  the  classifier  algorithm.  1 asto  and  Wintz  (19*  1) 
as  well  as  Chen  and  Smith  (1976)  utilized  this  local  energy  concept  to  perform 
the  classification  process.  In  both  cases,  the  local  AC  energy  (e.  g.  . pixel 
variance  subblocks)  is  deterministically  mapped  onto  a small  set  of  integers 
representing  the  finite  number  of  classes. 

The  energy  concept  appears  reasonable.  It  is  indicative  of  subblock 
business.  For  a unitary  transformation,  the  energy  is  an  invariant  para- 
meter. However,  energy  in  the  transform  domain  is  primarily  represented 
by  a few  transform  low  order  coefficients  only.  Consequently,  an  energy - 
based  classification  procedure  does  not  properly  assign  the  available  band- 
width among  the  various  classes,  since  the  relevant  decision  is  magnitude 
of  a few  low  order  coefficients. 

A recently  proposed  classification  procedure  utilizes  the  sum  of  the 
logarithms  of  coefficient  magnitudes  (Melzer,  1977).  This  quantity  can  be 
interpreted  to  be  image  entropy.  Thus,  the  classification  decision  is  made 
on  local  entropy. 

It  should  be  noted  that  zero  order  entropy  in  the  transform  domain 
closely  approximates  image  entropy,  since  the  coelf icients  are  nearly  de- 
correlated.  For  this  second  procedure,  the  classifier  specifies  classes  m 
accordance  with  local  entropy. 

Classification  includes  bit  allocation  as  well.  The  classifier  specifies 
two  sets  of  matrices,  representing  the  normalization  factors  for  coefficients 
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and  the  appropriate  bit  assignment.  The  latter  two  quantities  can  be  deter- 
mined independently  through  training  procedures.  For  an  n"  class  classi- 
fier. normalization  matrices  and  n bit  assignment  matrices  are  required. 
Alternatively,  first  the  normalization  matrices  are  obtained.  The  bit  assign- 
ment matrices  are  derived  from  the  normalization  matrices  according  to  the 
standard  rule  (Habibi  and  W'intz,  197  1).  The  distortion  parameter  (D)  is 
determined  such  that  the  required  number  of  bits  is  achieved.  Specific 
implementations  are  considered  in  Section  K.  1. 

Although  the  concepts  discussed  are  independently  straightforward,  tin- 
various  quantities  must  be  related  to  each  other.  The  classification  must 
be  jointly  considered  with  different  buffering  procedures. 

Classification  with  subblock  energy  can  be  performed  prior  to  trans- 
formation. Consequently,  it  can  be  performed  as  part  of  tin-  controlling 
function  for  buffer  control  using  the  prebuffering  concept. 

For  classification  in  the  transform  domain  using  local  entropy,  the  same 
procedure  becomes  rather  involved  with  the  prebuffering  concept.  The  pri- 
mary problem  is  to  perform  buffer  control.  It  is  necessary  to  operate  simul- 
taneously on  all  blocks  residing  in  the  prebuffer.  Before  the  controlling 
function  can  be  implemented,  all  subblocks  must  be  transformed.  therefore, 
the  requirement  develops  for  another  large  buffer  in  which  the  transform 
blocks  of  the  prebuffer  must  bo  stored. 

However,  if  rate  control  is  achieved  by  the  postbuffering  technique,  the 
requirement  for  the  additional  large  buffer  does  not  exist.  Postbuffer  control 
can  be  equally  well  implemented  through  spatial  and  transform  domain 


classification.  For  the  present  discussion,  the  emphasis  is  not  on  arithmetic 
complexity,  but  rather  on  required  buffer  size. 


K.  IMPLFMFNTATIONS  OF  ADAPTIVE  TRANSFORM  COPING  SYSTKMS 
Previous  subsections  reviewed  basic  concepts  relevant  to  transform 
image  coding.  Specifically,  the  required  concepts  for  adaptive  transform 
coding  were  considered.  It  should  be  reemphasized  that  adaptivity  is  neces- 
sary to  justify  the  additional  complexity  of  transform  coding.  However,  to 
date,  only  a limited  number  of  transform  coding  systems  have  been  studied. 

In  particular,  no  variable  rate  transform  coding  design  has  been  developed 
to  the  hardware  stage.  However,  several  computer  simulated  systems  have 
been  studied. 

In  this  section,  these  designs  are  reviewed.  Specifically,  fullv 
adaptive  transform  coding  procedures  with  a rat  equalizer  buffer  are 
discussed.  The  emphasis  is  on  implementation  concepts.  The  discussion, 
here,  relates  to  transform  techniques.  However,  it  should  be  pointed  out 
that  the  sante  concepts  are  general  and  are  also  applicable  to  other  coding 
techniques . 

Prior  to  the  discussion  of  mathematical  models  for  adaptive  transform 
coding,  specific  definitions  of  image  segmentation  are  required. 

Four  image  region  types  are  considered.  The  entire  image  is  referred 
to  as  the  "frame."  The  smallest  grouping  of  pixels  used  is  a "block.’'  The 
block  is  the  basic  input  to  the  transform  coding  algorithm.  It  corresponds 
to  the  pixel  set  over  which  the  transformation  is  performed.  The  "strip 
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refers  to  a set  of  blocks,  which,  in  general,  corresponds  to  n image  lines 
for  n x n block  size.  Transform  coding  can  be  implemented  with  nonsquare 
blocks.  However,  to  simplify  the  discussion,  only  square  blocks  are 
considered.  The  strip  also  corresponds  to  the  minimum  number  ot  sub- 
blocks required  to  perform  the  reformatting  function. 

The  "region"  corresponds  to  a relatively  small  image  area  over  which 
the  adaptive  coder  operates  without  additional  control  information  t rom  the 
buffer  control  algorithm.  The  region  size  can  be  a block  or  several  blocks. 

These  image  subelements  are  utilized  for  adaptivity,  adaptivity  control, 
and  local  transformation. 

1.  MEMORY  ARCHITECTURES 

A transform  coding  algorithm  operates  over  transform  blocks.  Thus, 
the  conventional  image  raster  format  requires  reformatting  inti'  blocks.  For 
variable  rate  adaptivity,  in  addition  to  a reformatting  buffer,  a rate  equalizer 
buffer  is  included. 

Memory  architecture  associated  with  the  indicated  buffers  is  simple. 

For  application  with  large  size  images,  these  buffers  may  become  important. 
Image  compression  with  large  formats  also  require  large  size  buffers. 

While  operation  complexity  remains  simple,  the  large  buffers  require  consid- 
eration of  expense  and  reliability. 

Reliability  is  particularly  important  for  the  equalizer  buffer  that  contains 
code  words.  An  error  within  the  rate  equalizer  buffer  is  equivalent  to 
channel  error. 
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The  buffer  that  perforins  the  reformatting  function,  although  rot  trulv 
random  access,  requires  tliat  memory  input  and  output  addressing  be  per- 
formed ditiererv.lv.  lire  rate  equalizer  buffer  mat  be  onsidered  sehemati 
■.ally  as  a two-sided  memory.  At  the  forward  side,  the  compressor  inputs 
code  words.  From  the  other  side,  tin  channel,  at  fixed  bit  rate,  removes 
code  words. 

Depending  on  adaptivity'  implementation,  a con t red  : unction  is  associated 
with  one  of  the  two  buffers.  1 he  information  required  tor  adaptivity  pro- 
cessing is  external  t ' i buffers.  For  example,  the  pre buffering  concept 
tor  adaptivity  control  requires  block  energy,  which  can  be  di  - rmitu  <:  on 
input  to  the  memorv.  C >nsequentlv,  no  memory  access  is  re<  uired  after  the 
appropriate  image  area  is  placed  into  this  buffer. 

bi  nilarly,  for  postbuffer  control  no  requirement  exists  to  access  data 
:!  c buffer.  The  buffer  c ontrol  requires  the  buffer  fullness  information 
- 1 ic 1 . again,  is  determined  externally  to  the  buffer.  the  buffer  fullness 
1 'ained  through  monitoring  ti  e number  of  bits  into  the  butter  and  into  the 
c i-  mi.  The  difference  bet\v<  en  the  two  is  the  buffer  fullness  required  for 
adaptivity  control. 

d.  SPECIFIC  DESIGNS 

In  tiiis  section,  several  approaches  to  adaptive  transform  coding  are 
discussed.  through  these  techniques,  actual  implementation  is  related  to 
theoretical  concepts  previously  developed. 

First,  discussion  is  on  open  loop  techniques.  An  open  loop  transform 
• ding  pr  i eduro  is  a fully  adaptive  algorithm  with  no  requirement  to 
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constrain  tin-  bit  fluctuation  within  an  equal  i/.er  buffer.  Cnnscquentlv,  no 
de  tor  minis  tic  procedure  maintains  the  algorithm  at  a specified  bit  rati'. 

i'tvo  techniques  have  been  developed  previously.  Since  these  techniques 
cannot  easilv  operate  in  a standard  communications  environment  that  requires 
fixed  rate  throughput,  they  are  mainly  of  interest  for  historical  perspective. 
However,  these  techniques  also  serve  as  a motivation  to  develop  the  required 
modifications  which  would  permit  them  to  operate  at  tixed  rate. 

The  first  open  loop  technique,  the  algorithm  of  Tasto  and  Wintz  (1971), 
utilized  the  K-L  transform.  Subblocks  were  partitioned  (classified)  accord- 
ing to  energy.  Four  classes,  each  requiring  a two-bit  overhead,  were  used, 
l he  implemented  quantization  strategies  included  a combination  of  a uniform 
quantizer  with  Huffman  coding.  lhe  paper  bv  Tasto  and  Wintz  strongly 
motivated  this  author  to  studs  the  practical  implementations  of  Huffman 
coding  in  the  transform  domain.  Algorithm  development  has  not  considered 
what  was  earlier  referred  to  as  training  procedures.  Both  training  and 
coding  were  performed  on  the  same  image.  Consequently,  the  problem 
associated  with  maintaining  channel  rate  from  one  image  to  another  and  the 
mismatch  problem  were  avoided.  The  results  demonstrated  that  adaptis’its 
is  a viable  concept . l he  Tasto  and  Wintz  paper  was  the  first  systematic 
studs  of  an  adaptive  transform  coding  procedure. 

Another  open  loop  technique  was  developed  by  1’escher  (197  a).  Hie 
adaptive  coding  procedure  eliminated  the  need  for  class  it’ic  at  ion . lhe  algo 
rithrv  utilized  large  transform  blocks  T -o  x JHo).  l he  ness-  concept  seas  to 


derive  recursively,  based  on  previously  decoded  information,  both  the 
normalization  terms  and  the  bit  assignment. 


Superficially,  this  technique  is  similar  to  conventional,  two-dimensional 
differential  pulse -code  modulation  (DPCM).  However,  unlike  DPCM,  rather 
than  the  actual  quantity,  its  variance  is  estimated.  Ihus,  the  appropriate 
estimation  procedure  obtains  the  standard  deviation  of  the  transform  coeffi- 
cient to  be  coded.  Since  only  previously  coded  values  are  used  in  the  esti- 
mation, the  procedure  is  fully  decodable.  The  decoder  repeats  the  same 
steps  of  the  coding  process. 

The  described  technique  is  fully  adaptive.  It  adapts  to  the  transform 
domain  structure.  The  compression  rate  is  determined  on  actual  image 
activity.  The  developed  technique  is  "self-truncating.  ' When  the  predicted 
number  of  bits  for  a coefficient  falls  below  unity,  no  further  information  is 
transmitted.  In  that  fashion,  at  that  point,  and  for  that  appropriate  image 
line  in  the  transform  domain,  the  coding  is  terminated.  This  open  loop 
technique  for  large  transform  sizes  has  been  successful  for  various  applica- 
tions including  monochrome  color  and  interframe  coding. 

The  two  fundamental  new  concepts  established  deserve  to  be  emphasized. 
The  first  result  is  that  a successful  adaptive  transform  coding  may  be  de- 
veloped without  a priori  assumed  fixed  model,  such  as  the  Markov  model. 

More  importantly,  a procedure  has  been  developed  which  permits  real  time 
model  identification  through  which  the  appropriate  bit  assignment  procedure 
can  also  be  implemented.  Through  the  second  concept,  an  algorithm  was 
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do vv l opoil  that  utilizes  tho  available  bandwidth  according  to  need.  Unfortunately, 
the  discussed  algorithm  is  impractical.  The  utilization  ot  largo  transform 
sizes  and  open  loop  techniques  is  inappropriate.  Tho  next  step  was  to  de- 
velop an  implementation  which  is  tullv  adaptive,  yet  utilizes  small  transforms. 

I he  author  extended  the  earh  adaptive  concepts  to  more  conventional  trans- 
form siz.es. 

Basic  features  of  the  algorithm  are  presented  in  Figure  1 >.  A recur- 
sive procedure  is  utilized.  Starting  with  the  largest  coefficient,  the  algorithm 
recursively  predicts  the  next  c .efficient  standard  deviation  from  who  h the 
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is  mapped  in  a zig-zag  fashion  to  yield  a one -dimens  ional  function.  1 he  sub- 
diagonal  elements  have  approximately  the  same  relative  importance.  thus, 
the  mapping  results  in  a monotonic  all  v decreasing  set  of  elements.  Several 
examples  are  shown  in  Figure  1.5.  Here,  the  absolute  coefficient 
values  on  a logarithmic  scale  are  given. 

I'he  demonstrated  procedure  is  fully  adaptive.  The  normalization 
coefficients,  the  bit  assignment,  as  well  as  the  total  number  of  bits  per 
block,  are  adaptively  determined.  i'he  algorithm  is  1 sol  t - to  rminat  ing  . 

When  the  predicted  number  of  bits  tor  a coetticient  tails  below  a tixod  value, 
say  two,  the  coding  is  terminated  for  that  block.  Similarly,  the  receiver 
duplicates  the  recursive  procedure;  thus,  tho  process  is  decodable. 

The  outlined  procedure  appears  to  be  promising.  However,  its  unde- 
sirable feature  is  the  open  loop  operation.  the  coding  algorithm  properly 


ii/art;  13.  Adaptive  Transform  Coder,  [fa  I The  ba 
of  the  coefficient  reordering.  The  dc  tc 


r.wv- 


m a unification 


Ar  # I H 
I I I * 
»-  k • U 
t ft! 

. t t ku 

• .lift 

'•  | t.ftl 

• h ►.  i 


W ^ - 


»»iiMUkkiniuif 

lli*kii.kkl.k».fck.  k * 1 1 
•kkktfci kkkkit. lilt 
!■  • » l k * > . i t k»  . k k k t 
IM  »**■*#•*'♦>  . l.llili 
ft*  ill)  *.*#*».  ..  k » » ft  ft 
kl  l • ft. Hr  m-0  4 f #■  » ft  r *'  # ftJ 
.ftl..L.k_*.k_._ift./  I It  . . t tit 
■rl  * ft*.  . . ir  • « »>  .•  * «•  ft  t.1 


indicates  that  the  coefficient  ■ 


allocates  bits  according  to  the  relative  importance  (c.g..  image  entropy  ) ot 
subblocks.  However,  channel  rati’  is  not  considered. 

The  next  logical  step  is  to  convert  the  open  loop  technique  to  a closed 
loop  procedure.  Through  the  utilization  of  the  previously  discussed  general 
rate  control  approach,  the  solution  is  straightforward.  Hut  tirst.  techniques 
based  on  classification  procedures  should  be  discussed. 

Chen  and  Smith  (19  70)  developed  a closed  loop  adaptive  procedure  which 
is  a simplification  of  Tasto's  technique.  The  Chen-Smith  coder  utilized  the 
cosine  transform.  Based  on  subblock  energy,  a four-class  partitioning  is 
performed.  Prebuffer  control  maintains  the  required  bit  rate. 

The  concept  is  as  follows.  The  coefficient  energy  of  subblocks  within 
the  reformatting  buffo  r is  measured  and  classified.  The  four-class  classifier 
requires  three  thresholds  in  subblock  energy.  For  each  class,  subblocks 
are  given  a predetermined  number  of  bits.  The  distribution  function  for 
subblock  energy  is  determined.  For  the  proper  thresholds,  the  product  of 
the  given  class  bit  rate  and  the  integral  of  the  distribution  function  summed 
over  the  four  c lasses  yields  the  required  rate,  as  shown  in  Figure  11. 
Consequently,  the  number  of  bits  tor  the  various  classes  permits  straight 
forward  computation  of  the  required  thresholds.  The  necessary  computation, 
although  simple,  must  be  performed  for  each  data  set  that  resides  in  the 
reformatting  buffer  For  a real  time  system,  the  computation  requires 
addit iona  l buffo  ring. 

In  their  paper.  Chen  and  Smith  (1976)  effectively  argued  the  benefits  of 
adaptive  coding.  Through  various  experiments,  they  identified  a typical 
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Figure  14.  Subblock  Classification  Based  on  Energy 
(Chen  and  Smith,  197b).  (The  four  indi- 
cated regions  partition  the  available 
bandwidth.  ) 


probabil  it  y di  s t r i but  ion  for  subblock  activity.  1'hey  observed,  consistent 
with  this  author  s experience,  that,  for  many  intake  ty  pes,  the  indicated 
probability  density  is  an  approximately  exponentially  decreasing  function  ot 
energy . 

1'liis  i>bs e rv ation  indicates  that,  in  general,  most  subblocks  are  relatively 
quiet.  Only  a small  number  of  subhlocks  contain  sipniticant  activity  , t on- 
sequentlv,  a stronp  motivation  tor  adaptivity  is  availably'.  1 lie  coding  algo- 
rithm at  relatively  loyy  rates  is  acceptable  for  a lame  fraction  ot  imatje  sub- 
blocks.  ,\  small  additional  bandyy  idth  is  required  to  an  oiuntodate  tin1  small 
number  of  active  subbloy'ks.  1 lie  c odinp  eypt’i'iiuent  ot  C hen  anil  Smith 
included  monochrome  as  yyell  as  color  images. 

Unlike  1'asto  and  U'int.i  (1^71),  i 'hen  and  Smith  did  not  consider  entropy 
online  of  tin-  transform  domain.  l asto  anil  Wintz  utilised  tin'  appropriate 
K-L  transform  for  each  class.  In  contrast,  the  implementation  by  Chun  anil 
Smith  utilized  the  cosine  transform  for  all  classes.  Ihe  probable  benetit 
of  a spec  ltu  transform  for  each  class  is  ma  rental 

I’he  techniques  of  both  Chen  and  lasto  share  a common  limitation.  Ihe 
classifier  uses  floating  thresholds  to  arrive  at  the  tixeil  rate.  Consequently, 
the  energy  c 1 a s s i fi  ca  t i on  is  adaptive  for  each  subblock.  Hoyy  a particular 
subblock  is  classified  depends  on  the  subblock  energy  distribution  for  the 
intake  region  which  is  utilized  for  threshold  determination. 

Cox  and  lescher  lU'Tt-'  dev  eli'pt'il  an  absolute  clas  sification  procedure 
o’  tin1  closeil  loop  format.  Ibis  y'lassifier  utilizes  tixi'd  preass  ljfneil  thres'olds. 


Thus,  each  subblock  is  classified  according  to  its  absolute  energy  without 
regard  to  the  distribution  of  neighbor  subblocks. 

To  maintain  a fixed  average  channel  rate,  bit  assignment  matrices  are 
recalculated  for  each  image  strip.  Figure  I1'  indicates  the  appropriate 
constraints.  Determination  of  the  D parameter  is  made  such  that  summation 
over  all  bit  assignment  matrices  results  in  the  total  allowed  number  of  bits 
for  that  image  segment  associated  with  the  adaptivity  computation.  One 
could  argue  the  superiority  of  this  technique:  an  image  subblock  is  paired 
with  the  appropriate  normalization  matrix  regardless  of  the  distribution  of 
other  subblocks  in  the  adaptivity  region. 

A major  criticism  of  previous  techniques  is  that  classification  is  based 
on  a single  parameter  which  can  provide  oul\  limited  information  on  image 
structure.  Subblock  entropy  utilization  should  yield  superior  classification. 
The  problem  still  remains  how  tit  model  the  normalization  matrix  for  a class. 

The  next  logical  approach  is  to  implement  the  recursive  fully  adaptive 
technique  over  small  subblocks  with  postbulfer  feedback  control.  The  result 
is  a self -consistent  transform  coding  procedure.  This  technique,  developed 
by  this  author,  is  discussed  next. 

Rased  on  the  foregoing  discussion,  the  combination  of  general  rate 
control  and  the  open  loop  adaptive  transform  coding  is  relatively  simple. 

The  procedure  is  to  develop  a controller  that  specifies  to  the  previously 
open  loop  technique,  the  D parameter. 

Itt  Figure  lo,  a flow  chart  is  given  for  the  appropriate  logic.  The 
same  figure  lists  the  parameters  nocessart  to  implement  the  algroithm. 


Figure  1 ^ 


Flow  Diagram  of  an  Absolute  Classification 
Procedure  (Cox  anti  Tescher,  10  7b).  I The 
following  definitions  apply:  n^  is  the  num- 
ber of  subblocks  classified  in  the  k-th  class 
according  to  absolute  energy.  The  algo- 
rithm determines  dynamically  bp)(k),  the 
number  of  bits  for  coefficient  i,  and  j in 
class  k.  I 
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' i i r e 16.  Buffer  Feedback  luOi>  ic  . [(a)  The-  basic  system,  (b)  Parameter  definitions. 
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The  initial  condition  must  be  separately  specified.  For  the  first  subblock 
to  be  coded,  an  assumed  distortion  (D^)  is  specified  which  corresponds  to 
the  desired  average  rate.  The  indicated  algorithm  is  stable.  Even  for  un- 
reasonable initial  conditions,  the  controller  rapidly  stabilizes  at  an  effec- 
tively steady  state  condition. 

Conceptually,  the  controller  attempts  to  maintain  the  buffer  bias  (B) 
within  the  available  buffer.  Consequently,  the  maximum  deviation  from  a 
specified  rate  is  negligible  for  image  sizes  much  larger  than  the  postbuffer 
size . 

The  postbuffer  control  size  has  a limited  impact  on  performance, 
although  it  is  small.  Computer  simulations  have  indicated  a small  image 
quality  degradation  in  terms  of  mean  square  error  as  the  postbuffer  size  was 
reduced  from  two-image  strips  to  one-fourth  of  an  image  strip. 

As  discussed  in  Section  D.4,  this  technique  can  utilize  parallel  com- 
putation. and  it  is  flexible  in  image  format.  Various  examples,  including 
buffer  performance  behavior,  are  shown  in  Figure  17.  It  should  be  noted 
that  the  various  parameters  behave  reasonably,  and  no  sign  of  an  unstable 
buffer  behavior  develops.  The  intermediate  controller  parameters  are  also 
shown.  They  are  the  "desired''  local  rate  and  the  estimated  and  required  D 
parameters.  The  estimated  parameters  represent  quantities  averaged  over 
large  image  segments.  These  parameters  change  only  slowly.  The  instan- 
taneous rate  associated  with  individual  subblocks  fluctuates  significantly.  At 
two  bits  per  pixel,  for  example,  the  instantaneous  rate  variation  is  likely  to 
exceed  a factor  of  four. 
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Hu*  described  procedure  is  highly  adaptive  to  local  im.ijje  structure 
and  i s ^ 1 1 t e rimnat  mg . Vet,  the  procedure  behaves  m a manner  consistent 
witli  a t uiiiinimu  atiiin  channel  for  practical  operation.  Kor  a lower  average 
!nt  rate,  while  more  image  subblocks  operate  at  a lower  instantaneous  rate, 
the  coder  still  assigns  a relatively  large  number  of  bits  to  a selected  small  set. 

Ibis  technique  is  particularly  beneficial  for  high  quality  imagery  with 
high  signal  to  noise  properties.  For  this  case,  the  various  image  regions 
are  dispersed  in  a local  image  structure.  The  algorithm  requires  minimal 
overhead.  The  coder  adapts  to  each  subblock  independently.  Since  possible 
combinations  for  different  subblocks  are  essentially  infinite,  this  procedure 
effectively  utilizes  an  infinite  number  ot  classes. 

The  algorithm  can  be  extended  to  utilize  a two-dimensional  predictor. 
However,  the  prodution  mechanism  has  to  be  more  complex  to  follow  each 
transform  line  of  rapidly  dec  axing  transform  coefficient  amplitudes. 

A maior  practical  advantage  of  the  self-adaptive  transform  coding  algo- 
rithm is  its  capability  to  operate  without  any  prior  training.  Thus,  the  same 
algorithm  can  be  equallx  applicable  for  significantly  different  image  types. 

t.  C'llANNKl.  KKUOK  CONSIO KR ATIONS 

While  consideration  of  the  channel  error  problem  is  important,  it  is  a 
separate  problem.  Kxcept  for  I lien  and  Smith  (l*l7t*l,  studies  ot  adaptive 
transform  techniques  have  not  considered  channel  correction  procedures 
Chen  and  Smith  performed  several  experiments  to  demonstrate,  not  un 
«■  xpei  t edl  \ . that  add  it  tonal  c ha  line  1 > onsnle  ration  i an  ellei  t ivelx  minimize 
the  channel  error  problem 


Channel  e r r»  r sensitivity  usually  is  directly  proportional  to  the  degree 
of  adaptivity.  In  principle.  .1  high  sensitivity  to  eliaunel  noise  may  he  taken 
advantage  ot  m the  self  adaptive  transform  lodmg  algorithm.  This  t an  be 
a 1 1 umpl  i s he  d as  follows  A single  channel  error  causes  synch  roni/alion 
loss.  llnis,  subsequent  transform  values  yield  erroneous  transform  van 
ante  estimates.  Since  the  estimated  values  immediately  lifetime  obviously 
unrealistic,  it  is  relativelv  easv  to  ulentifv  an  error.  \ simple  algorithm 
mav  monitor  the  estimated  variaiue  or  the  predieteil  bit  assignment  to  check 
it  the  algorithm  is  working  properl  v. 

For  a typical  subbloek,  the  estimated  bit  assignment  values  over  small 
transform  domain  regions  are  constant  or  tlet  reusing;  only  occasionally 
does  a small  inrriMSi'  occur  in  the  allocated  number  ot  bits  tor  the  next 
coefficient.  Consequently,  it  the  predicted  number  of  bits  begins  to  increase, 
a channel  error  is  indicated.  Once  the  error  is  detected,  the  decoder  mav 
simply  ignore  the  remaining  part  ot  the  subbloek.  Since  the  erroneous  bit 
is  closely  localized,  one  mav  attempt  to  correct  the  error  by  trial  and  error 
and  recover  the  rest  of  the  subblock.  However,  this  is  a conceptual  approach 
yet  to  be  implemented. 

•1.  APPLICABILITY  OF  CONCH PTS  I'O  O THKR  I'KCHNIOUKS 

While  it  is  not  appropriate  for  this  author  to  consider  image  coiling 
procedures  other  than  transform  techniques,  it  is  worthwhile  to  point  out 


ou 


that  soim1  of  the  discussed  concepts  of  this  section  are  directly  applicable 
with  little  or  no  modifications  to  other  coding  techniques  (Tescher  and  C'ox, 

19i7)  (e.g.,  the  self  adaptive  procedure  is  applicable  to  the  hybrid  transform/ 

DPCM  coding).  The  generalized  buffer  feedback  procedure  is  applicable  to  all 
image  coding  types  as  well  as  to  general  data  compression  procedures.  In 
particular,  the  same  feedback  logic  is  useful  for  entropy  coding  as  well  as  to 
procedures  where  entropy  coding  is  utilized  jointly  with  another  coding 
tec  htiique . 

F.  C. ON C LIJS1UNS 

This  report  has  attempted  to  demonstrate  that  transform  coding  is  a 
valuable  and  efficient  technique  for  image  compression.  However,  various 
considerations  are  necessary  before  the  transform  coding  procedure  can  be 

used  in  a practical  environment.  This  report  reviews  adaptivity  concepts  , 

followed  by  considerations  of  variable  rate  algorithms  with  the  constraint 
of  the  fixed  rate  channel. 

Several  solutions  an'  identified.  Some  of  these  solutions  are  also 
applicable  to  other  data  compression  algorithms.  Adaptive  transform  coding 
procedures  are  promising.  These  techniques  can  and  should  be  implemented 
through  a feedback  mechanism  to  control  the  rate.  The  necessary  hardware 
will  likely  exceed  what  is  needed  for  a nonadaptive  technique.  However,  the 

benefits  will  more  than  offset  the  hardware  penalty.  j 

The  required  technology  is  in  existence.  The  discussed  channel  error 
correction  procedure,  based  on  the  detection  of  catastrophic  failure  location, 
is  an  interesting  problem  that  would  probably  be  worthwhile  to  pursue. 
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' s >uld  be  again  emphasized  that  the  most  promising  problem  areas  for 
improved  efficiency  are  not  in  the  development  of  new  transform  algorithms 
but  rather  in  the  procedures  that  follow  the  transformation. 

Several  other  considerations  associated  with  transform  image  coding 
need  to  be  reviewed.  Although  not  explicitly  stated,  transform  coding  is  a 
statistical  procedure  based  on  the  decorrelation  property  of  the  transformation. 
An  alternative  philosophy  based  on  an  approximation  theory  has  been  pro- 
posed recently.  Image  regions  are  considered  to  be  two-dimensional  seg- 
ments that  the  data  compression  algorithm  approximates  by  some  known 
func  tion  set. 

Therefore,  one  could  claim  that  an  approximation  with  sinusoidal  func- 
tions may  be  suboptimal . The  relevant  question  is  whether  other  e-  -ir'innal 
approximations  of  image  segments  may  converge  faster  than  the  sinusoidal 
set.  This  question  is  discussed  with  reference  to  the  singular  value  decom- 
position technique  of  image  segments  which,  in  the  least  square  sense,  is 
optimum  (Albert,  1 1 h 

Unfortunately,  what  may  be  optimum  or  attractive  in  approximation 
theory  does  not  always  result  in  an  efficient  coding  system.  The  least 
squares  approach  requires  the  transmission  of  the  transformation  parameters 
as  well  as  of  the  appropriate  coefficients.  The  required  bandwidth  is  likely 
to  become  considerable.  For  a specific  type  of  imagery,  such  as  artificially 
generated  scenes,  an  approximation  theory  may  be  appropriate  (Andrews  and 
Patterson.  lQ7f',-  McCaughev,  1^7 b).  However,  most  imagery  can  be  charac- 
terized onlv  in  a statistical  sense.  Therefore,  an  approximation  theory  will 
not  yield  improvements  over  statistical  transform  techniques. 
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For  completeness,  analog  transform  implementation  should  he  mentioned. 
I he  CCD  technology  allows  analog  implementation  of  various  transform  types 
(Uuss,  et  al . . D'75).  This  interesting  development  is  beneficial  to  data  pro- 

cessing applications  involving  large  size  transforms.  For  realistic  trans- 
form coding  algorithms,  the  benefits  arc  marginal,  since  large  transform 
sizes  and  the  associated  high  number  of  arithematic  operations  are  not 
justified.  For  specialized  implementations  with  stringent  power  and  weight 
requirements,  an  analog  approach  might  be  appropriate. 

Another  area  which  needs  further  study  and  could  be  potentially  useful 
for  transform  coding  is  a better  understanding  of  image  quality  criteria. 

The  self-adaptive  technique  approximately  performs  classification  according 
to  transform  domain  entropy,  which  is  also  a measure  of  image  entropy. 

For  large  bandwidth  compression,  transform  block  classification  according 
to  some  generalized  i riteria  may  be  valuable  even  at  the  expense  of  in 
creased  distortion.  However,  it  must  be  remembered  that  both  theoretical 
and  practical  limits  for  an  image  compression  algorithm  with  negligible 
distortion  are  determined  by  the  entropy  measure.  A highly  adaptive  trans- 
form colling  technique  closely  approximates  the  actual  image  entropy. 

Future  algorithm  developments  in  compression  rate  reduction  are  not 
likely  to  be  dramatic.  Most  improvements  may  be  accomplished  by  the 
implementation  of  techniques  discussed  in  this  report.  Considerable  effort 
is  still  required  to  ensure  that  the  various  designs  are  implemented  through 
prnclunl  hardware. 
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In  this  chapter,  the  primary  algorithm  components  that  are  necessary 
for  implementation  arc  identified.  In  addition,  it  was  attempted  to  interface, 
at  least  conceptually,  some  of  these  components.  What  arc  these  component 
For  transform  coding,  a transform  algorithm  is  needed.  Prior  to  the  trans- 
form. a reformatting  butter  is  required.  An  important  new  component  was 
the  buffer  control  algorithm. 

Hardware  implementation  for  the  discussed  techniques  could  benefit 
front  a possible  design  that  is  based  on  independent  components.  These 
components  can  be  i tide  pendent  l \ developed  and  appropriately  interfaced  for 
a particular  transform  coding  algorithm.  This  modular  design,  in  addition 
to  transform  coding,  is  also  applicable  to  other  data  compression  techniques 
such  as  entropy  coding  through  a buffer  feedback  mechanism. 

This  author  believes  that  a well -designed  adaptive  transform  coding 
algorithm  represents  the  most  efficient  image  compression  technique  for 
high  quality  data.  Although  conceptual  design  configurations  discussed  in 
this  chapter  are  promising,  they  are  still  only  in  the  simulation  stage  and 
the  actual  hardware  design  is  yet  to  be  pursued. 


REFERENCES 


Ahmed,  N.  , Natarajan,  T.,  and  Rao,  K.  (1974),  "Discrete  Cosine 
Transform,  ' IEEE  Trans.  Comput.  , C - 2 3 , 90-9  3. 

Albert,  A.  (1972),  Regression  and  the  Moore-Penrose  Pseudoinverse, 
Academic  Press,  New  York. 

Anderson,  G.  B.  and  Huang,  T.  S.  (1971),  "Piecewise  Fourier 

Transformation  for  Picture  Bandwidth  Compression,  IEEE  T rans. 
Common.  Technol.  , COM-19,  1 33-  140. 

Andrews,  H.  C.  (1975),  "Two-Dimensional  Transforms,  ' in  Picture 
Processing  and  Digital  Filtering  (Topics  in  Applied  Physics, 

T.  S.  Huang,  ed.  ),  Vol.  6,  Springer- Verlag,  New  York. 

Andrews,  H.  C.  and  Patterson,  C.  L.  , III  (1976),  "Singular  Value 

Decompositions  and  Digital  Image  Processing,  ' IEEE  Trans.  Acoust.  , 
Speech,  and  Signal  Process.  , ASSP-  24,  26-53. 

Bracewell,  R.  (1965),  The  Fourier  Transform  and  Its  Applications, 
McGraw-Hill,  New  York. 

Buss,  D.  D.  , Brodersen,  R.  W.  , Hewes,  C.  R.  , and  Teasch,  A.  F.  , Jr. 

( 1975),  "Technology  of  Charge-Coupled  Devices  for  Video  Bandwidth 
Reduction.  " In  Tescher  (1975),  48-56. 

Chen,  W.  and  Smith,  C.  H.  (1976),  "Adaptive  Coding  of  Color  Images 

Using  Cosine  Transform,  " ICCC76,  IEEE  Catalog  Number  76CH1085-0 
CSCB , 47-7  through  47-13. 


105 


PHECED1NG  FA GE  hLAAK 


Coolev,  T.  W.  and  Tukey,  J.  W.  '965),  ’'An  Algorithm  for  the  Machine 

Calculation  of  Complex  Fourier  Series,  1 J.  Math.  Computation,  19, 
2^7. 10 1 . 

Cornsweet,  T.  N.  (1970),  Visual  Perception,  Academic  Press,  New  York. 

Cox,  R.  V,  and  Tescher,  A.  G.  (1976),  'Channel  Rate  Equalization 

Techniques  for  Adaptive1  Transform  Coders.  ' In  Tescher  (1976), 

2 19-240. 

Dail,  ,T.  E.  (19761,  'Polynomial  Transform  Coding  of  Image  Signals, 

ICC76,  IEEE  Catalog  Number  76CH  1084-0,  CSCB,  47-14  through 
47-19. 

Davisson,  L.  D.  (1972),  Rate  Distortion  Theory  and  Applications,  ’1 
Proc,  IEEE,  60,  800-808. 

Fellgett,  P.  N.  and  Linfoot,  E.  H.  ( 1955),  'On  the  Assessment  of 

Optical  Images,  1 Philos.  Trans.  R.  Soc.,  London,  Series  A,  247, 
369-407. 

Fink,  D.  G.  ( 1955),  Color  Television  Standards.  N.T.S.C.,  McGraw-Hill, 
New  York. 

Gallager,  R.  G.  (1968),  Information  Theory  and  Reliable  Communications, 
John  Wiley  and  Sons,  New  York. 

Goodman,  J.  W.  (1968),  Introduction  to  Fourier  Optics,  McGraw-Hill, 

New  York. 

Habibi,  A.  (1971),  ''Comparison  of  n-th  Order  DPCM  Encoder  with  Linear 
Transformations  and  Block  Quantization  Techniques,  IEEE  Trans. 
Comrnun.  Tcchnol.  , COM  - 1 9.  948-956. 


106 


107 


Jain,  A.  K.  (1976),  "Some  New  Techniques  in  Image  Processing.  " In 
Wilde  (1976),  201-223. 

Jones,  II.  W.,  Jr.  (1976),  'A  Real-Time  Adaptive  lladamard  Transform 
Video  Compression.  " In  Tcschcr  (1976),  2-9. 

Knauer,  S.  C.  ( 1975),  "Real-Time  Video  Compression  Algorithm  for 
lladamard  Transform  Processing,  " In  Tescher  (1975),  58-69. 

Landau,  II.  J.  and  Slepian,  D.  (1971),  "Some  Computer  Experiments  in 

Picture  Processing  for  Bandwidth  Reduction,  " Bell  System  Tech.  J.  , 
50,  1525-1540. 

Levi,  L.  (1970),  "On  Image  Evaluation  and  Enhancement,  " Optica  Acta, 

\1_,  59-76. 

Lukosz,  W.  (1962),  "Ubertragung  Nicht- Negative r Signale  Duzch  Linear 
Filter,  " Optica  Acta.  9,  335-364. 

Max,  J.  (I960),  "Quantizing  for  Linear  Distortions,  " IRE  Trans.  Info. 
Theory,  IT-6,  7-12. 

McCaughey,  D.  G.  (1976),  "Variable  Knot  Splines  as  an  Image  Analysis 
Technique , " In  Wilde  (1976),  168-175. 

Melzer,  S.  ( 1977),  "Image  Transform  Coding  with  Generalized  Correlation 
Models.  " In  "Sensor  Data  Characteristics  and  Data  Compression 
Projects  of  Future  Military  Satellite  Systems"  (R.  G.  Nishinaga,  Ed.), 
Tech.  Doc.  Rep.  No.  TOR- 0077  (2060 )- 1 , App.  C,  The  Aerospace 
Corporation,  El  Segundo,  Calif. 

O'Neal,  J.  B.  (1971),  "Entropy  Coding  in  Speech  and  Television  Differential 
PCM  Systems,  "IEEE  Trans.  Inf.  Theory,  66-7  1. 

Panther,  P.  F.  and  Ditc,  W.  (1951),  "Quantization  Distortion  in  Pulse-Code 
Modulation  with  Nonuniform  Spacing  of  Levels,  " Proc.  IRE,  39, 


108 


Papoulis,  A.  ( 1965),  P rob  ability  Random  Va  n aides  and  Stoehastie 
Pi'iu  i'ssi's,  MeCi  raw  - I li  1 1 , Ncu  York. 

IVarson,  D.  1*'.  ( 1 T ^ , 1'  ra  nsmi  s sum  and  Display  of  1 *i  i 1 o n a 1 1 nt  o final  i on  , 
John  Wiley  and  Sons,  Now  York. 

Pratt . \V.  , Chon,  \V  . , and  Welch,  1,.  (1974),  ’Slant  Transform  Image 
Coding,  1 KK  P Trans.  Common,  Technol.  , COM-22 , 1075_  109  3. 

Koador,  C.  (I'*?'"'),  'Intraframo  and  Intorframo  Adaptive  Transform 
Coding,  In  Tesche  r ( 1 7 S > , 108-1  IS. 

Kois,  J.  .1  . , l,yneh,  R.  I'.,  Butman,  J.  (ll)7t>),  'Adaptive  llaar  Transform 
Video  Bandwidth  Keduetion  System  for  RPV's.  In  1’escher  (B*7o), 
24-35, 

Roberts,  I,.  Ci.  ( 1 *■>  t>  J ) , "Pietnre  Coding  ('sing  Pseudo-  Random  Noise,'1 
1 R P Trans.  Info.  Theory,  IT  -8.  I4B-1B8. 

Roese,  J.  A.  and  Pratt,  \\  . K.  , ' "Theo reti eal  Performanee  Models  for 
Interframe  Transform  and  Hybrid  I ranst'orm/DPTM  Coders.  hi 
l eseher  ( P'7t>).  172-  179. 

Sihreiber,  \V,  P . (1067),  'Pietnre  Coding,’’  Hroc.  IPPP,  55,  120-330. 

Shannon,  ('.  P.  ( 1 4 8) , "A  Mathematical  Theory  of  Communi  ea  t urns  , 

Bell  System  Teeh.  J.,  27,  37b_42  3 and  62  3-656. 

Shannon,  C.  P.  and  Weaver,  \V  . (P'.pl),  The  Mathematieal  Theory  ot 
Commnnieations,  I'niversitv  of  Illinois  Press,  111. 

I'asto,  M.  and  Wintz,  P.  (1071),  Image  Coding  In'  Adaptive  Bloek 
Quantizations,  IPPP  Trans.  Common.  Teehnol.  , l OM-1'1, 


I'eschor,  A.  (i.  (10?  t),  1 The  Roll’  of  Phase  in  Adapt  i ve  lmap.o  t 'odi  njj , 

Phl>  Pi  s se  rtat  ion , P.  of  Southern  ( 'al  i form  a , PSCIl’  I Report  ‘'10. 


IVsv  Ium*.  A . l *• . 

(to?1'),  editor,  efficient  Transmission  of  Pii 

to  r i a 1 

Information,  Si'll  l’roc.  t't',  Rellingham,  W a slii  ngt  on. 

To  si  her,  A . ( I . 

( to;, A,  editor,  Vlvaiu  es  in  Image  Transmisi 

Sion 

Technigui 

•s,  SPIV'  Pror.  87,  Rellingham,  W a slii  ngt  on. 

T e si  lie  r , A . Ci . 

and  Andrews,  11.  (’.  (to?.)),  'The  Role  ot  Ad 

a p t i y e 

foiling  in  Two.  and  I'll  ivr  - Pi  monsion.il  Kounor  ,»n<l  Walsh  linage 
Compression,  11  in  Applications  of  Walsh  Functions  and  Segno m \ 
l'hoorv  (11.  Si  h r o iho  r a ml  (a.  !•'.  Sandv,  oilitors),  ll'I'l-'  Cal. 

No.  7-1(1 10 SO  t -Sl  'MC. 

Tescher,  A.  (i.  anil  ("os,  R.  V.  (107t'1,  "An  Adaptive  rr.insforin  Codinr. 
Algorithm,  1 lCC?(i,  ll'I'l'  Catalog  Nunihor  7ni'l  II  0S‘>  - 0 rSt'l', 

■17-20  through  47-2c'. 

Tescher,  A.  ( i . and  Co\,  R.  V . (l‘>77),  "Image  Coding:  V.in.iMi-  Rato 
lUH'M  through  Fi\oil  Rato  Channel.  " In  Tescher  (1°7  71. 

Whitohouso,  11.  .1..  Moans,  R.  W . , and  Wrench,  K.  11.  (ll)7M,  Real-Time 
I'olovision  Image  Randwidlh  Reduction  using  Charge  Iransfo  r Oovirrs. 
In  l'eschor  (|0?s)(  l(i-47. 

Wild*',  O.  C . and  Ra r r et  t , K . , edi  to r s ( I o 7 id , 1 ill. ice  Si" i cm  e M a t hemal  i o s , 
Western  Periodicals  Company,  North  Hollywood,  Calif. 

Wint/.,  P.  A.  (10721,  "Transform  Picture  Coding,"  Pror,  ll'I'l',  (>0, 
SOO-iS.’O. 


I I 0 


■M 


■ 


— 





